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Abstract 

This is the summary and introduction to the proceedings contributions for the 
Les Houches 2009 "Tools and Monte Carlo" working group. 
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1. FOREWORD 

The working group on "Tools" and Monte Carlos for TeV-scale physics held discussions throughout the 
two-week period of the Les Houches meetings. The topics covered herein span both sessions. Several of 
the topics followed on from those discussed in the "Standard Model Handles and Candles" session of the 
previous workshop [1]; there has been substantial progress and several new topics were introduced. 

The contributions here in fact include substantial physics results derived from the programmes, in- 
terfaces and techniques discussed, as well as status reports on existing projects and proposals for new 
standards and interfaces. 

In Part I we have collected the more technical proposals for common standards and interfaces, mostly 
required because of the rapid progress in higher order calculations. Part II collects results on the tuning 
of MC simulations, a critical topic for understanding LHC data. Part III contains several contributions 
comparing various all-order calculations with fixed-order results, with and without matching between 
the two. Part IV reflects some key issues on the communicability of results between experiment and 
theory. Part V discusses recent progress and ideas in using jets and jet substructure at the LHC to study 
QCD and search for new physics, and finally Part VI discusses progress in some key modeling tools for 
beyond-the-standard-model physics. 

The productivity and pleasure of this workshop is overshadowed by the dreadful loss of our friend and 
colleague Thomas Binoth, and we dedicate these proceedings to him. 
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Parti 

INTERFACES 

2. A STANDARD FORMAT FOR LES HOUCHES EVENT FILES, VERSION 2 

Q 

2.1 INTRODUCTION 

The Les Houches Accord (LHA) for user-defined processes I2j has been immensely successful. It is 
routinely used to pass information from matrix-element-based generators (MEGs) to general-purpose 
ones (here, somewhat unfairly referred to as parton shower generators — PSGs), in order to generate 
complete events for a multitude of processes. The original standard was in terms of two Fortran common 
blocks where information could be stored, while the actual usage has tended to be mainly in terms of files 
with parton-level events. For this purpose a new accord — the Les Houches Event File (LHEF) accord (3) 
— was introduced in 2006, which standardized the way such event files should be structured. 

The LHEF was constructed using XML tags in order to make it flexible and easy to extend (although 
some additional structure is assumed inside some tags which is not formulated in XML). The format has 
been extremely useful, and has basically become the standard way to interface matrix element generators 
and parton shower programs. 

As the matching and merging of tree-level matrix elements and parton showers are now becoming the 
state-of-art, it is reasonable to let this be reflected in an updated file format to standardize how relevant 
information should be given by the matrix element generators in a usable fashion for the parton shower 
programs. Furthermore, with the matching of next-to-leading order (NLO) calculations and parton showers 
becoming increasingly widespread, it is worth considering how the LHEF can be augmented in order to 
facilitate this. 

For the CKKW-type merging algorithms f^J5\ it is convenient to allow the Sudakov-reweighting to be 
done in the MEG, as this will automatically regularize soft- and collinear divergencies. Hence it would be 
desirable if the LHEF could include information about this. This does not only mean that a weight needs 
to be added, but also information about which cuts has been imposed in the MEG as well as information 
on how the generated event was clustered to obtain the relevant scales and Sudakov form factors. 

In the case the events are produced by a NLO MEG, the situation is a bit more complicated. Here 
a subtraction scheme is typically used to handle the cancellation between real and virtual corrections. 
This means that, besides loop-level events, each tree-level real event with one extra parton will need to 
be supplemented by counter events corresponding to the assumed projections of the tree-level event to 
born-level events with one parton less. To allow for matching or merging with a PSG, these events need to 
be considered together in a group of events, something that was not forseen in the original file format. 

Independent of these ME-PS matching considerations, we also wish to introduce some further, minor, 
additions to assist the determination of errors arising from the the parton density function (PDF) parame- 
terizations used in the MEG. Normally these error estimates are given as a set of different PDFs where 
the parameters have been varied around the best fit value. Hence, a given event may be associated with 
several weights corresponding to the different PDFs used. 

Note that the scope of the format suggested here is somewhat different from the HepML schema Q 
(used by eg. the MCDB project f7|). The LHEF format is specialized in the interface between matrix 
element generators and parton shower programs, while HepML is intended to give more general meta- 
information on how events have been produced by an event generator. However, there is nothing that 

'Contributed by L. Lonnblad Email leif . lonnbladSthep .lu.se J. Alwall, S. Belov, L. Dudko, L. Garren, K. Hamil- 
ton, J. Huston, D. Kekelidze, E. Maina, F. Maltoni, M. Mangano, R. Pittau, S. Platzer, A. Sherstnev, T. Sjostrand, P. Skands. 
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<LesHouches Events version="l . 0"> 
< ! — 

# optional information in completely free format, 

# except for the reserved end tag (see next line) 
— > 

<header> 

<! — individually designed XML tags, in fancy XML style — > 
</header> 
<init> 

compulsory initialization information 

# optional initialization information 
</init> 

<event> 

compulsory event information 

# optional event information 
</ event> 

{further <event> ... </event> blocks, one for each event) 
</LesHouchesEvents> 



Fig. 1: The original structure of a Les Houches event file. 



prevents the LHEF format to be included in the HepML structure in the future. 



The outline of this article is as follows. In section 2.2 we review the structure of the original LHEF 



accord and of the Les Houches common block structure on which it is based. Then in section 12.31 we 
present the additional XML tags which may be used to specify additional global, and per-event information. 
Finally we give a brief summary and outlook. 



2.2 THE ORIGINAL EVENT FILE FORMAT AND COMMON BLOCK STRUCTURE 

The first version of the Les Houches event file format was a simple structure specifying how to write the 
Les Houches common blocks to a text file. A few XML tags were defined to simplify parsing but not 
much more than the information in the common blocks was formalized. The structure of a file is outlined 
in figure [T] where the tags are as follows. 

• LesHouchesEvents: which contains the whole file and which mandates a version attribute 
set to "1 .0". 

• header: which may contain any number of unspecified XML tags describing how the events were 
generated. 

• init: This is the tag which specifies the information in the HEPRUP common block. The start tag 
must be alone on a line and the following line must contain the information which is in common for 
all processes in the file. The lines following this must contain the per-process information from the 
common block, one process per line. If there are any other lines before the end tag, they must be 
preceded by a #-sign (c.f. figure[2]l. 

• event: The init tag may be followed by any number of event tags, one for each event 
generated. Also the event start tag must be alone on a line and the following line must contain the 
general event information from the HEPEUP common block. The lines following this must contain 
the per-particle information, one line per particle. Also here additional lines may be included before 
the end tag if they are preceded by a #-sign. (c.f. figure|3]l. 

• Before the init tag one may, optionally, include arbitrary text enclosed in XML comment tags, 
< ! — ... — >, but no other text is allowed in the enclosing LesHouchesEvents tag. 

For a more detailed description of the LHEF format we refer to |[3|. 
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<init> 

IDBMUP(l) IDBMUP(2) EBMUP ( 1 ) EBMUP(2) PDFGUP(l) PDFSUP(l) PDFSUP(2) IDWTUP NPRUP 
XSECUP ( 1 ) XERRUP ( 1 ) XMAXUP ( 1 ) LPRUP ( 1 ) 
XSECUP(2) XERRUP (2) XMAXUP (2) LPRUP (2) 

XSECUP (NPRUP) XERRUP (NPRUP) XMAXUP (NPRUP ) LPRUP (NPRUP) 

# Additional 

# information 
</init> 



Fig. 2: The structure of the init tag in the original LHEF format. See ||2) for the meaning of the different 
common block variables. 



<event > 






























NUP IDPRUP XWGTUP SCALUP AQEDUP AQCDUP 






























IDUP(l) ISTUP(l) M0THUP(1,1) M0THUP(2,1) 


ICOLUP (1 


1) 


ICOLUP (2 


1) 


PUP (1 


1) 


PUP (2 


1) 


PUP (3 


1) 


PUP (4 


1) 


PUP (5 


1) 


IDUP(2) ISTUP(2) H0THUP(1,2) MOTHUP(2,2) 


ICOLUP (1 


2) 


ICOLUP (2 


2) 


PUP (1 


2) 


PUP (2 


2) 


PUP (3 


2) 


PUP (4 


2) 


PUP (5 


2) 


# In total 1+NUP lines after the <event> 


tag 




























# Additional 






























# information 






























</event> 































I — I 

Fig. 3: The structure of the event tag in the original LHEF format. See \2j for the meaning of the 
different common block variables. 

2.3 THE NEW FILE FORMAT 

We now describe our suggestion for an updated file format which includes the additional information 
mentioned in the introduction. All such information is encoded in XML tags with optional attributes given 
in the usual way: 

<tag attributel="value " attribute2="value">content</tag> 
or, for a tag without content, 

<tag attributel="value " attribute2="value " attribute3="value" /> 

The new tags can either be given in the init block, should they refer to the whole file, or in the event 
block, if they only refer to an individual event. In addition group tags can be inserted to group events 
together. 

2. 31 GLOBAL INFORMATION 

The following tags may be included inside the init tag and contain additional global information about 
how the events in the file were produced. They must be placed after the mandatory lines containing 
HEPRUP common block information (see figure|2]), but otherwise the order is unimportant. Only tags 
which are not marked optional below need to be supplied. 

The generator tag (optional) This is just added to give easy access to the name of the program which 
has generated the file. The content of the tag is simply the name and the only allowed attribute is 

- version: a string describing the version of the generator used. 

The xsecinfo tag (required) The information in the HEPRUP common block is in principle sufficient 
to determine the cross sections of the processes involved. Currently, the way in which this information 
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is specified is a bit complicated and sometimes confusing, since it was assumed to be used to pass 
information between the MEG and PSG in both directions. For the event file, the communication is per 
definition one-way, and the information can be made more easily accessible. The tag itself has no content, 
and the information is given in the following attributes. 

- neve (R^ the number of event^in the file. 

- totxsec (R): the total cross section (in units of pb) of all processes in the file. 

- maxweight (D=lQ the maximum weight of any evenj^in the file (in an arbitrary unit). 

- minweight (D=-maxweight): if the file contains negative weights, the minweight is the 
most negative of the negative weights in the file. (Must obviously be the same unit as maxweight.) 

- meanweight (D=l): The average weight of the events in the file (same unit as maxweight). 

- negweights (D=no): If yes, then the file may contain negative weights. 

- varweights (D=no): If yes, then the file may contain varying event weights. If no, all events 
are weighted with maxweight (or, if negweights=yes, with minweight). 

- eventgroups (D=:no): If yes, the events in the file may be grouped together with group tags, 
in which case the attributes above count an event group as one event rather than several separate 
ones. 

- maxingroupweight (D=maxweight): If eventgroups=yes, this gives the maximum 
weight among the events inside groups. 

- miningroupweight (D=-maxingroupweight): If eventgroups=yes, this gives the 
minimum weight among the events inside groups. 

Note that it is assumed that all processes in the file are weighted with respect to a common total 
cross section, such that summing the weights for the events of a given process and multiplying with 
totxsec/maxweight/neve will give the cross section for that process. In this way, the per-process 
information in the HEPRUP common block can be safely ignored. 



The cutsinfo tag (optional) This tag is used to supply information about which kinematical cuts were 
used to generate the events in the file. Several different cuts can be given with cut tags and it is possible 
to specify which particles should be affected by each cut using ptype tags. 

The cut tag contains an actual cut made in the generation of the events. In general, all events in the file 
will pass this cut. The cut is defined in terms of a kinematical variable and the particles which are affected. 
The content of the tag is one or two numbers giving the allowed range of value of the kinematical variable 
according to the attribute limit (see below). 

The variable is defined according to the following attributes of the cut tag: 

- pi (D=0): Lists the particle types for which this cut applies. This can be either a number, 
corresponding to a given particle PDG [8| code, or a string corresponding to a group of particles 
previously defined with a ptype tag (see below). The default is zero which means any particle 
type. 

- p2, . . . , p9: Allows the specification of additional sets of particle types, by analogy to pi, in order 
to facilitate the application of different classes of cuts to different classes of particles. 

- type (R): This defines the variable which is cut. The following values are predefined, but also 
other variables may be specified. (Where relevant, the laboratory frame is assumed, and all energy 

units are in GeV.) 

^(R) means the attribute is mandatory 



^Note that if the file contains events inside group tags (see section 2.33 below), neve must refer to the number of event 



groups (plus the events which are outside the groups). 

■*For attributes which are not mandatory, (D=. . . ) indicates which value is assumed it not present 



Note that if the file contains events inside group tags (see section 2.33 below), maxweight, minweight and 



meanweight must refer to the weights of the groups (and the weights of the events which are outside the groups). 



8 



- m: the invariant mass of a particle of type p 1 . If additional particle types are specified the cut 
applies to the invariant mass of the corresponding number of particles, i.e. if pi, p2 and p3 
are specified the cut is on the invariant mass of any set of three matching particles. 

- pt: the transverse momentum of a particle matching pi. 

- eta: the pseudo-rapidity of a particle matching pi. 

- y: the true rapidity of a particle matching pi. 

- deltaR: the pseudo-rapidity-azimuthal-angle difference (^y~KTf~+~K(j?) between two parti- 
cles matching pi and p2 respectively. 

- E: the energy of a particle matching pi. 

- ETmiss: the norm of the vectorial sum of the pt of final state particles matching pi and not 
matching p2 (Note that an empty p2 defaults to the empty set here and for HT below). 

- HT: the scalar sum of the transverse momentum of final state particles matching pi and not 
matching p2. 

- limit (D=min): If set to min (max) only one number should be marked by the tag and give the 
minimum (maximum) for the kinematical variable, while if it is set to minmax, there should be 
two numbers corresponding to the minimum and maximum (in that order). 

The groups of particles to be considered in the pi and p2 attributes of the cut tag are specified by 
ptype tags, which simply contains the PDG codes of the particle types belonging to the group. The only 
allowed attribute in the ptype tag is 

- name (R): the name of this group of particle types. 

Here is a short example on how to specify a cut where a charged electron or muon is required to have a 
transverse momentum of at least 20 GeV and a minimum of 25 GeV missing transverse energy is required: 

<cutsinf o> 

<ptype name="l+-">ll -11 13 -13</ptype> 

<ptype name="nu">12 -12 14 -14 16 -16</ptype> 

<cut type="pt" pl="l+-">20</cut> 

<cut type="ETmiss" pl="0" p2="nu">25</cut> 
</cutsinf o> 



The procinfo tag (optional) The procinfo tag is used to supply additional per-process information 
in addition to what is given in the HEPRUP conomon block part of the init tag. The content of the tag is 
simply an arbitrary string describing the process. The attributes are the following: 

- iproc (D=0): The process number for which the information is given. This must correspond to the 
LPRUP code in the HEPRUP common block for the corresponding process. Also zero can be given, 
in which case it refers to all processes in the file (except those with a separate procinfo tag). 

- loops (D=0): The number of loops used in calculating this process. 

- qcdorder: The power of as used in calculating this process. 

- eworder: The power of the electro-weak coupling used in calculating this process. 

- r scheme (D=MSbar): The renormalization scheme used in calculating this process. 

- f scheme (D=MSbar): The factorization scheme used in calculating this process. 

- scheme (D=tree): Information about the scheme used to calculate the matrix elements to NLO. 
If absent, a pure tree-level calculation is assumed. Possible values could be CSdipole (NLO 
calculation with Catani-Seymour subtraction (9)), FKS pOpl) , MC@NLO p2p3) , POWHEG p4p3) 
and NLOexclusive (NLO calculation according to the exclusive cross section (see eg. |16|) 
within the given cuts). 
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The mergetype tag (optional) For some merging schemes (eg. for CKKW) it is possible to reweight 
the the events with Sudakov form factors already in the MEG. If this has been done the content of the 
mergetype tag for the corresponding process should give a name corresponding to the scheme used. 
The attributes are: 

- iproc: The process number for which the information is given. A zero means all processes except 
those with a separate mergeinf o tag. 

- mergingscale (R): The value of the merging scale in GeV. 

- maxmult (D=no): If yes, the corresponding process is reweighted as if it is the maximum 
multiplicity process, i.e. the Sudakov form factor associated with evolution between the smallest 
clustering scale and the merging scale is not included. 

2.32 PER-EVENT INFORMATION 

Information about a given event may be given with XML tags after the mandatory lines containing 
HEPEUP common block information (see figure[3]). 

The weight tag (optional) An event can be associated with a number of different weights given in 
weight tags. The content of these tags is simply a sequence of weights corresponding to the cross 
section for the event using different PDFs, as values, etc. which can be used to estimate the systematic 
errors due to, e.g., PDF uncertainties. Each weight tag should be given a name for identification. Only 
one we i ght tag per event can be without a name and should then only contain one weight, which is 
the one for which the statistics in the xsecinf o tag is given. The attributes of the weight tag are as 
follows 

- name: An arbitrary string describing this set of weights. If no name is given this is the main weight 
for the event. 

- born (D=l): If this is not a normal tree-level event but reweighted in some way (eg. by Sudakov 
reweighting or using loop contributions), this should be set to the relative weight of the tree-level 
cross section. 

- sudakov (D=l): If this event has been reweighted by a Sudakov form factor, the size of this factor 
should be given here. 

The last two attributes will probably only be given for the main weight. If an event has only been 
reweighted by a Sudakov form factor then these attributes are related by born*sudakov=l. The total 
Bom cross section is obtained by summing the weights multiplied by born for each event of the given 
process, and multiplying with totxsec/maxweight/neve from the xsecinf o tag. 

The clustering tag (optional) If an event has eg. been reweighted with Sudakov form factors, it is 
possible to specify how the current event has been clustered to find the scales involved. The contents 
of this this tag should be a series of clus tags. The clustering should be defined from the final state 
backwards in terms inverse time-like splittings, in the end defining a "bare" ladder diagram. This is then 
followed by a sequence of space-like splittings. 

The clustering tag contains a number of clus tags corresponding to each of the splittings. Each 
clus tag contains two or three integers. The first two numbers indicate which particles entries in the 
HEPEUP common block are clustered. If a third number is given it should correspond to an actual particle 
entry which corresponds to the combined object (if eg. a decayed resonance is explicitly present in the 
HEPEUP common block). If no third number is given, the clustered object is in the following referred to 
by the first number. The attributes of the clus tag are: 

- scale: The scale (in GeV) associated with the clustering. 

- alphas: If the event has been reweighted with an as at the scale of this clustering, the value of 
this as should be supplied here. 
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<clustering> 
<clus>6 7 5</clus> 
<clus>3 8</clus> 
<clus>l 3</clus> 
<clus>l 5</clus> 
<clus>l 4</clus> 
</clustering> 



Fig. 4: Example of how a Feynman diagram can be encoded using a clustering tag. The numbering in the 
diagram corresponds to the particle entries in the HEPEUP common block 



We also wish to draw attention to the fact that the clustering tag can equally be used to encode the 
Feynman diagram (or the most likely of the ones) used to produce the event. See figure|4]for an example. 

The pdfinfo tag (optional) The pdf inf o tag contains the values of the PDFs used when generating 
this event, given by two numbers, xfi{xi, Q^) and xf2{x2, Q"^), for the two incoming partons. The 
attributes are: 

- pi: The PDG code of the first incoming parton. 

- p2: The PDG code of the second incoming parton. 

- X 1 : The momentum fraction of the first incoming parton. 

- x2: The momentum fraction of the second incoming parton. 

- scale: (D=SCALUP) The scale in GeV used in the PDFs (the defauh is taken from the HEPEUP 
common block). 

2. 33 GRO UPING OF EVENTS 

If we have a NLO calculation using eg. Catani-Seymour subtraction of a process with particles in 
the Born level, each + 1 tree-level event will come with a number of counter events with N particles. 
For this reason there is a need to group events together. Such a group of events should be included in a 
group tag. 

The group tag (optional) The content of this tag is a number of event tags which for should be 
considered together. If this is a + 1 tree-level event with a number of A^-particle counter events, the first 
event must always be the N + 1-particle event even if this event fails the cuts. The rest of the events are 
then the counter events. The group tag must also contain at least one main weight tag (without name 
attribute) which is the one for which the statistics in the xsecinf o tag is given. The individual weights 
of the events in the group should sum up to the weight of the whole group. The only allowed attribute is 

- n (R): The number of events in the group. 

Note that if event groups are present, the neve attribute in the xsecinf o tag should count an event 
group as a single event. Also, it is the weight of the event group which relates to the maxweight 
and meanweight attributes in the xsecinf o tag. To be compatible with the previous standard, where 
the <event> and </event> tags are required to be alone on a single line, also the <group> and 
</group> tags are required to be alone on a single line. 
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2,4 OUTLOOK 

Event files which follows this new standard should have their version attribute in the LesHouches- 
Events (see figure[T]l tag set to 2 . 0. The web pagelhttp : / /home . thep . lu . se/~leif /LHEF/| 
contains a number of C++ classes implementing the reading and writing of files according to the new 
standard. 

We have tried to make the new standard backward compatible with the previous one, and although all 
existing parsers may not be able to read new files we propose to keep the old preferred . Ihe file name 
extension. 

As with the previous standard, the current proposal must not be viewed as the end of the road. There 
may be further information exchange that ought to be standardized. It is allowed to use/promote a "private 
standard" of tags in the header block or of additional event information, and experience with such could 
point the way towards an extended standard at a later date. 



Note that a formal description of the proposed new standard can be found at the Les Houches 09 wiki 
pages 



http : / /www . Ipthe . jussieu . f r/LesHouchesO 9Wiki/ images/ 6/50 / Grammar . pdf 
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3. A DRAFT RUNTIME INTERFACE TO COMBINE PARTON SHOWERS AND NEXT-TO- 
LEADING ORDER QCD PROGRAMS [] 

3.1 INTRODUCTION 



Parton shower simulation programs like Pythia 6 and Herwig 6 1 17 18 1 have been the workhorses for high 
energy physics experiments for a long time. With the advent of the Large Hadron Collider (LHC) at CERN, 
the FORTRAN generators have been completely rewritten and extended as Pythia 8 and Herwig++ 1 19 -21] 



and the program Sherpa has been established p2[ 23 1. 



Many new developments have been made in order to refine these simulations and to extend their 
applicability. Particularly, progress is now being made towards automation of schemes combining 
parton shower Monte Carlos and NLO QCD corrections consistently along the lines of now established 
schemes [12 14 1. Especially the POWHEG scheme exhibits close connections to methods correcting the 
hai^dest emission in parton shower simulations to the relevant exact real emission matrix element, |24 1. 
Dedicated codes performing just the matching to NLO QCD are as well under development, |25 1. 

In order to use existing and future tools for fixed-order calculations, especially in view of efforts towards 
automation present for these problems as well, it is desirable to specify an interface to the generic building 
blocks of fixed-order, particularly NLO QCD calculations carried out within the subtraction formalism. 

The approach of communicating simulation results between different programs taken so far by ex- 
changing event files of a definite format |{3j is not appropriate for the task being addressed here. Rather a 
runtime interface between two distinct codes is desirable. Here, one code links to another either statically 
or dynamically and makes use of the implemented functionality through an interface which is now a set of 
definite function calls. 

The interface should however not limit the particular ways in which fixed-order calculations are carried 
out, neither how the matching is actually performed by the parton shower Monte Carlo or a dedicated 
matching code. It should further not limit which programming language is actually being used and 
is therefore formulated in a language independent way. Any binding to a particular language is an 
implementational detail. 



3.2 PRELIMINARIES 

3. 21 OBJECTS HANDLED BY THE INTERFACE 

The interface aims at having each individual building block of a leading- or next-to-leading order QCD 
calculation at hand. In particular 

• phase space generation 

• tree-level matrix elements 

• subtraction terms 

• finite parts of one loop/Born interferences 

• finite remainder terms originating from coUinear factorization 

are addressed as individual objects. These objects will be defined in detail in the following sections. 

3. 22 DIFFERENTIAL CROSS SECTIONS 

The interface is formulated to handle all pieces inherent to a leading- or next-to-leading order differential 
cross section to be available in the following form, where a denotes any contribution as listed in the 

'Contributed by: S. Platzer 
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previous subsection: 



do-ij- = fi(.Xi,fJ-F)fj{Xj,fiF) X 

d(l){{pi,Pj,pi, ...,Pn},Xi,Xj 



df 



dV . (1) 



Here {i, j, ki, kn} identifies a particular subprocess of which the term indexed by a contributes 
to, {pi,pj,pi, ...,pn} is the physical phase space point, and r is a set of d random numbers in the 
d-dimensional unit hypercube. 

In the following the Jacobian determinant will be referred to as a phase space generator. The corre- 
sponding functionality will be addressed as an individual part of differential cross sections within the 
interface. Note that this does not reference the precise way in which the Monte Carlo integration or the 
generation of unweighted events is performed by any client cod^ 

The parton density functions (PDFs) / as well as the value of renormalization scale fiR, factorization 
scale and the strong coupling will be provided by the client code, such that the part of the interface 
representing the contribution a is completely specified as a function evaluating F". Additionally, few 
bookkeeping specifications will be introduced. 

For tree-level matrix elements (including the real-emission part of a NLO calculation, F just represents 
the corresponding tree-level matrix element squared times the appropriate flux and symmetry factors. 



3. 23 NLO DIFFERENTIAL CROSS SECTIONS AND SUBTRACTION 

The interface focuses on NLO QCD corrections being carried out within the subtraction formalism to 
obtain finite contributions from both virtual and real-emission corrections to any infrared safe observable. 

Examples of schemes widely in use are [9| [T0{[26| , the interface to be specified in detail in the next 
section should however not limit any implementation to a particular scheme. 

Few universal properties are however assumed to be common to any subtraction scheme at next-to- 
leading order. In particular it will be assumed that the auxiliary cross section introduced to subtract the 
divergent behaviour of the real emission contribution to NLO corrections for a 2 — )• n process is of the 
form 

k 

5^ da,",,(p„+i, i„+i)0(K(Pn+i), i;?) . (2) 

a=l 

Here, O denotes an infrared safe observable (sometimes also referred to as a jet defining function) and 
Pn{Pn+i) is a unique and invertible momentum mapping from a real emission to an underlying Born 
configuration, which is identified by a flavour mapping — )• z^. 

The number of individual subtraction terms k is not assumed to be fixed. Besides the unique association 
of a subtraction term to an underlying Bom configuration each subtraction term is expected to subtract 
of a particular set of one or more collineai" divergences, which at NLO are identified by just labelling 
the pair of partons becoming collinear. This information is crucial for a parton shower simulation to 
assign an additional parton to a unique emitter in a meaningful way thereby fixing the initial conditions 
for subsequent showering. 



3. 24 'COLLINEAR REMAINDERS ' 

After analytical integration of the subtraction terms, the encountered divergences cancel these present in 
the virtual correction and counter terms needed to renormalize the parton distribution functions. The finite 
remainder left from cancelling the latter divergences will then additionally depend on the factorization 
scale, the momentum fractions xi.2 and convolutions in one or two variables z G [0, 1] are in general 

^Throughout this work the NLO code itself is considered to be 'client' code of the interface. 
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present. Further, these convolutions are assumed to be casted in a form which is suitable to be done by 
Monte Carlo methods as well. The interface therefore assumes a modified version of the contribution F 
associated to these finite remainders, 

^{5-fcl,...,fc„} = ^{3!fci,...,fe„}(fe'Pj'^^l'-'?'"}'/^R'MF,ari,X2,2;i,2;2) , (3) 

where two additional random numbers zi,2 on the unit interval are provided such that the convolution is 
performed on averaging over all events. 

3.25 COLOUR DECOMPOSITION 

In order to determine parton shower initial conditions but as well for the purpose of implementing 
independent subtraction schemes, the knowledge of partial amplitudes for the Bom and real emission 
contribution is crucial. 

The basis being used in this colour decomposition is however not unique. To the extent that parton 
shower initial conditions are usually determined by colour flows in the large- limit, it would be desirable 
to have this decomposition in the fundamental representation of SU {N = 3) available. Here, the basis 
tensors are just strings of Kronecker d's, 

S^---S^, (4) 

where an upstairs index transforms according to the anti-fundamental, a downstairs index according to the 
fundamental representation, indicating outgoing colour (incoming anticolour) and outgoing anticolour 
(incoming colour), respectively. Note that this representation is not limited to, but well suited, for the 
large- limit. The interface assumes this representation being available, but may be extended to other 
representations leaving it up to the cUent code to change basis from one to another decomposition. 



3.26 NOTATION 

The interface is formulated as a set of functions, taking any number of arguments and returning a result. 
The result may be a composite object of several types, which, depending on the language binding, may 
individually be passed as references to the function call. 

A function of name F taking arguments of type A1 , A2,..., returning a result of type T is denoted by T 
F (A1 ,A2,...), where each argument may be followed by a name identifying the meaning of the argument. 
The required types are defined in the following section, except for obvious simple types. Semantic 
definitions of each interface part are accompanied by pre- and postconditions where needed. Examples of 
language bindings are given to explain the relation between the abstract notation and implementational 
details. 



3.27 TYPES USED 

The purpose of this section is to introduce the more complex types used to specify the interface components. 
Few basic types with obvious semantics are used without further documentation. 

vector - A Ust of objects of the same type. Elements in the list can be accessed randomly by an 
integer index. The type of objects stored is indicated in angle brackets, e.g. vector<double> is a vector 

storing doubles. The size of the vector is to be asserted in the context of a particular piece of the interface. 
Examples are INTEGER V(4) or std::vector<int> v(4); for FORTRAN and C-i-i-, respectively. 

pair - A pair stores to values of potentially different type. This is an auxiliary concept which does not 
have to be supported by a particular language. A pair storing values of type T1 and T2, respectively, is 
denoted pair<T1 ,T2>. 

union - A union generaUzes pair to more than two entries. 

processid - Defined to be vector<int>. Identifies a subprocess giving PDG ids for incoming partons 
in the first two entries, and PDG ids for outgoing partons in the following entries. The size of the vector is 
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guaranteed to be greater or equal to three. The subprocess gg ddg, for example, would be identified by 
the entries {21, 21, 1, -1, 21}. 

momentum - Defined to be vector<double> of length five. The first four entries contain the four- 
momentum components Px,Py,Pz, E in units of GeV. The fifth component is optional containing the 
invariant mass squared in units of GeV squared. The metric is agreed to be mostly-minus, p-q = 

EpEq - p- q. 

pspoint - Defined to be union<double,pair<double,double>,vector<momentum> >. Represents 
a phase space point as the phase space weight (Jacobian from unit-hypercube random numbers to phase 

space measure), momentum fractions of incoming partons and a set of momenta. The first two momentum 
entries specify the momenta of incoming partons, subsequent these of outgoing partons. The size of the 
momentum vector is greater or equal to three. The weight is given in units of the proper power of GeV to 
obtain a dimensionless quantity. 

COlourflow - Defined to be vector<pair<int,int> >. Represents a colour flow assigned to a particular 
subprocess in the following convention: in association to a processid, the first and second members of an 
entry of a colourflow at position i contain an integer id for the colour and anticolour line, the corresponding 
parton at position i in the process id is coimected to. By convention, a triplet always has its second 
colour index set to zero, an antitriplet its first index. A singlet uses the pair {0, 0}. Note that this notation 
is not limited to large- flows, but may represent any basis tensor in the fundamental representation 
by the following identification: Each non-zero entry in the first entry of a pair in a colour flow vector 
identifies an index transforming according to the anti-fundamental, each entry at the second position an 
index transforming according to the fundamental representation of SU (N) and entries of the same id are 
attached to a Kronecker-5. Thus, the suppressed singlet contribution to a gluon (cf. the Fierz identity 
for the fundamental representation generators) carries the pair {k,k}, where k is not zero. Example: 
Consider the process e+e~ — > ddg, which would be identified by the processid {—11, 11, 1, —1, 21}. 
Here two colour flows are possible, the leading part Sy^^^J'^, and the 1/N suppressed 5*^(5*^ contribution. 
The first one would be identified by the COlourflow {{0, 0}, {0, 0}, {ki, 0}, {0, ^2}, {k2, ki}}, the latter 
by {{0, 0}, {0, 0}, {ki,0}, {0, ki}, {k2,k2}}, where ki ^ k2 are non-zero, positive integers. 

3.3 SPECIFICATION OF THE INTERFACE 

Initialization and Bookkeeping 

boo I initialize () - The fixed-order code performs initiaMzation and reads any relevant parameters from 
its preferred input mechanism. It returns true on success and false on failure. 

pair<int,String> alphasinfo () - Return information on the running of the strong coupling to be used 
as a combination of the number of loops contributing to the QCD /3-function, and a string identifying the 
renormalization scheme used. Values for the latter have to be agreed on. 

bool haveleadingorder (processid) - Return true, if the process identified by the given processid can 
be calculated at leading order. 

vector<colourflow> colourflows (processid) - Return the possible colourflows which could be 

selected for the given processid. 

vector<colourflow> largencolourflows (processid) - Return the possible colourflows in the large- iV 
limit which could be selected for the given processid. 

bool haveoneloop (processid) - Return true, if one loop QCD corrections to the given process can be 
calculated. 

string havesubtraction (processid) - Return a non-empty string identifying a subtraction scheme, 
if real emission subtraction terms are available for the given process. This does not require that the real 
emission process itself can be calculated. Return an empty string, if subtraction terms are not present. 
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Other return values have to be agreed on. 

vector<processid> realemissions (processid) - Assuming the given processid identifies a Bom 
process, return the real emission processes to be considered for a NLO QCD correction. 

vector<int> subtractions (processid) - Assuming the given processid identifies a real emission 
process, return a list of ids for the subtraction tenns to be considered. The ids need to be unique to the 
fixed-order code and are not used except for identifying a subtraction term. For dynamically allocated 
vectors, the function may return an empty vector to indicate that the process considered is non-singular, 
for fixed-size vectors it should indicate this by filling the vector with zeroes. 

processid underlyingborn (processid, int) - Given a real emission process id and subtraction term 
id through the first and second arguments, respectively, return the underlying Born process the subtraction 
term maps to. 

vector<pair<int,int> > coUinearlimits (processid, int) - Given a real emission process id and 
subtraction term id through the first and second arguments, respectively, return the positions of the partons 
in the processid given, for which the identified subtraction term subtracts coUinear singularities. The 
ordering of the returned value entries is irrelevant and by convention fixed such that the first entry is less 
than the second. 



Kinematics and Phase Space 

int ndim (processid) - Return the number of random numbers needed to generate a phase space point 
for the given process. 

pspoint phasespace (pair<momentum,momentum>, vector<double>, processid) - Generate a 
phase space point given incoming particle's momenta, a list of random numbers e]0, 1[ and a process id. 



Dynamics 

Each function call defined here represents the associated contribution to differential cross section as 
defined in eq. [T] Results are assumed to be scaled by the appropriate power of GeV such as to return a 
dimensionless quantity. 

double me2 (processid, pspoint, double) - Return the helicity and colour summed matrix element 
squared for the given process, phase space point and value of the renormalization scale in GeV. 

double partialme2 (processid, pspoint, double, colourflow) - Return the hehcity summed partial 
amplitude squared, identified by the given colour flow, evaluated for the given process, phase space point 
and value of the renormalization scale in GeV. 

double bornvirt (processid, pspoint, double) - Return the helicity and colour summed Bom- 
virtual interference plus the integrated subtraction terms (i.e. the finite part remaining after carrying out 
subtraction), consistent with the subtraction scheme chosen for the real emissions. Arguments in order are 
the process to be considered, the phase space point and the renormalization scale in GeV. 

double collinear (processid, pspoint, double, double, pair<double,double>) - Return the finite 
collinear remainder contribution consistent with the subtraction scheme chosen for the process identified 
by the given processid and phase space point. Further arguments in order are renormalization and 
factorization scales in units of GeV, and two further random variables on the unit interval to perform 
Monte Carlo integration over convolutions present. 

double subtraction (processid, int, pspoint, double) - Return the subtraction term identified by real 
emission process and subtraction term id, given a phase space point and renormalization scale in units of 
GeV. 
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CONCLUSIONS 

A general runtime interface has been outlined to access the individual building blocks of a next-to-leading 
order (NLO) QCD calculation carried out within the subtraction formalism. 

Such an interface is an indispensable tool for 'cUent' programs dedicated to the matching of parton 

showers and higher order corrections. A subset of the interface may as well be used to implement so-called 
matrix element corrections within parton shower Monte Carlos, correcting the hardest shower emission to 
the exact real emission matrix element squared. 
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4. STATUS OF THE FLAVOUR LES HOUCHES ACC0RE|!] 
4.1 INTRODUCTION 

In addition to the increasing number of refined approaches in the literature for calculating flavour-related 
observables, advanced programs dedicated to the calculation of such quantities, e.g. Wilson coefficients, 
branching ratios, mixing amplitudes, renormalisation group equation (RGE) running including flavour 



effects have recently been developed p7] - |3T| . Flavour-related observables are also implemented by many 
other non-dedicated public codes to provide additional checks for the models under investigation [32] - |38) . 
The results are often subsequently used by other codes, e.g. as constraints on the parameter space of the 
model under consideration |l39]-(42l . 



At present, a small number of specialised interfaces exist between the various codes. Such tailor-made 
interfaces are not easily generalised and ai^e time-consuming to construct and test for each specific 
implementation. A universal interface would clearly be an advantage here. A similar problem arose some 
time ago in the context of Supersymmetry (SUSY). The solution took the form of the SUSY Les Houches 
Accord (SLHA) |43 44|, which is nowadays frequently used to exchange information between SUSY 
related codes, such as soft SUSY-breaking parameters, particle masses and mixings, branching ratios etc. 
The SLHA is a robust solution, allowing information to be exchanged between different codes via ASCII 
files. The detailed structure of these input and output files is described in Refs. ||43}|44|. 

The goal of this work is to exploit the existing organisational structure of the SLHA and use it to define 
an accord for the exchange of flavour related quantities, which we refer to as the "Flavour Les Houches 
Accord" (FLHA). In brief, the purpose of this Accord is thus to present a set of generic definitions for an 
input/output file structure which provides a universal framework for interfacing flavour-related programs. 
Furthermore, the standardised format will provide the users with a clear and well-structured result that 
could eventually be used for other purposes. 

The structure is set up in such a way that the SLHA and the FLHA can be used together or independently. 
Obviously, some of the SLHA entries, such as measured parameters in the Standard Model (SM) and the 
Cabibbo-Kobayashi-Maskawa (CKM) matrix elements are also needed for flavour observable calculations. 
Therefore, a FLHA file can indeed contain a SLHA block if necessary. Also, in order to avoid any 
confusion, the SLHA blocks are not modified or redefined in the FLHA. If a block needs to be extended 
to meet the requirements of flavour physics, a new "F" block is defined instead. 

Note that different codes may technically achieve the FLHA input/output in different ways. The details 
of how to 'switch on' the FLHA input/output for a particular program should be described in the manual 
of that program and are not covered here. For the SLHA, libraries have been developed to permit an 
easy implementation of the input/output routines |'45'|. In principle these programs could be extended to 
include the FLHA as well. 

It should be noted that, while the SLHA was developed especially for the case of SUSY, the FLHA is, 
at least in principle, model independent. While it is possible to indicate the model used in a specific block, 
the general structure for the information exchange can be applied to any model. 

This report summarizes the current status of the FLHA. Several issues are not defined in an unambigous 
way yet. This will be indicated in the text below. 



4.2 CONVENTIONS AND DEFINITIONS 

The structure of the Flavour Les Houches Accord input and output files is based on the existing SUSY 
Les Houches Accord structure and flavour quantities are defined in blocks. The general conventions for 
the blocks are very similar to the SLHA blocks ||43| and they are not reproduced here. 

Since a FLHA file can also contain SLHA blocks, to clearly identify the blocks of the FLHA, the first 

'Contributed by: F. Mahmoudi, S. Heinemeyer, A. Arbey, A. Bharucha, T. Goto, T. Hahn, U. Haisch, S. Kraml, M. Muhlleitner, 
J. Renter, P. Skands, P. Slavich 
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letter of the name of a block is an "F". There are two exceptions to this rule: blocks borrowed from the 
SLHA, which keep their original name, and blocks containing imaginary parts, which start with "IMF". 

The following general structure for the FLHA file is proposed: 

• BLOCK FC INFO: Information about the flavour code used to prepare the FLHA file. 

• BLOCK FMODSEL: Information about the underlying model used for the calculations. 

• BLOCK SMINPUTS: Measured values of SM parameters used for the calculations. 

• BLOCK VCKMIN: Input parameters of the CKM matrix in the Wolfenstein parameterisation. 

• BLOCK UPMNS IN: Input parameters of the PMNS neutrino mixing matrix in the PDG parameteri- 
sation. 

• BLOCK VCKM: Real part of the CKM matrix elements. 

• BLOCK IMVCKM: Imaginary part of the CKM matrix elements. 

• BLOCK UPMNS: Real part of the PMNS matrix elements. 

• BLOCK IMUPMNS: Imaginary part of the PMNS matrix elements. 

• BLOCK FMASS: Masses of quarks, mesons, hadrons, etc. 

• BLOCK FLIFE: Lifetime (in seconds) of flavour-related mesons, hadrons, etc. 

• BLOCK FCONST: Decay constants. 

• BLOCK FCONSTRATIO: Ratios of decay constants. 

• BLOCK FBAG: Bag parameters. 

• BLOCK FWCOEF: Real part of the Wilson coefficients. 

• BLOCK IMFWCOEF: Imaginary part of the Wilson coefficients. 

• BLOCK FOBS: Prediction of flavour observables. 

• BLOCK FOBSERR: Theory error on the prediction of flavour observables. 

• BLOCK F OB S SM: SM prediction for flavour observables. 

• BLOCK FFORM: Form factors. 



More details on several blocks are given in the following. The blocks SMINPUTS, VCKMIN, UPMNS IN, 
VCKM, IMVCKM, UPMNS, IMUPMNS are defined exactly as in the SLHA(2) and not further discussed 
here. 



BLOCK FCINFO 

Flavour code information, including the name and the version of the program: 

1 : Name of the flavour calculator 

2 : Version number of the flavour calculator 

Optional warning or error messages can also be specified: 

3 : If this entry is present, warning(s) were produced by the flavour calculator. 

The resulting file may still be used. The entry should contain a description 
of the problem (string). 

4 : If this entry is present, error(s) were produced by the flavour calcula- 

tor. The resulting file should not be used. The entry should contain a 
description of the problem (string). 
This block is purely informative, and is similar to BLOCK SP INFO in the SLHA. 



BLOCK MODS EL 

This block provides switches and options for the model selection. The SLHA2 BLOCK MODSEL is 
extended to allow more flexibihty. 
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1 : Choice of SUSY breaking model or indication of other model. By 
default, a minimal type of model will always be assumed. Possible 
values are: 
-1 : SM 

: General MSSM simulation 

1 : (m)SUGRA model 

2 : (m)GMSB model 

3 : (m)AMSB model 

4 : ... 

31 :THDM 

9 9 : other model. This choice requires a string given in the entry 9 9 



: (Default=0) Choice of particle content, only used for SUSY models. The 
defined switches are: 
:MSSM 



1 
2 



NMSSM 



4 : (Default=0) R-parity violation. Switches defined are: 

: R-parity conserved. This corresponds to the SLHAl. 

1 : R-parity violated. 



5 : (Default=0) CP violation. Switches defined are: 

: CP is conserved. No information on the CKM phase is used. 

1 : CP is violated, but only by the standard CKM phase. All other phases 

are assumed zero. 

2 : CP is violated. Completely general CP phases allowed. 



6 : (Default=0) Flavour violation. Switches defined are: 

: No flavour violation. 

1 : Quark flavour is violated. 

2 : Lepton flavour is violated. 

3 : Lepton and quark flavour is violated. 



31 : defines the type of THDM, is used only if entry 1 is given as 31, 
otherwise it is ignored. 

1 : type I 

2 : type IT 

3 : type m 

4 : type TV 

9 9 : a string that defines other models is used only if entry 1 is given as 9 9, 
otherwise it is ignored. 
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BLOCK FMASS 



The block BLOCK FMASS contains the mass spectrum for the involved particles. It is an addition to the 
SLHA BLOCK MASS which contained only pole masses and to the SLHA BLOCK SMINPUTS which 
contains quark masses. If a mass is given in two blocks the block FMASS overrules the other blocks. In 
FMASS we specify additional information concerning the renormalisation scheme as well as the scale at 
which the masses are given and thus allow for larger flexibility. The standard for each line in the block 
should correspond to the following FORTRAN format 

(Ix, 19, 3x, IP, E16 . 8, OP, 3x, 12, 3x, IP, E16 . 8, OP, 3x, ' #' , Ix, A) , 

where the first nine-digit integer should be the PDG code of a particle, followed by a double precision 
number for its mass. The next integer corresponds to the renormalisation scheme, and finally the last 
double precision number points to the energy scale (0 if not relevant). An additional comment can be 
given after #. 

The schemes are defined as follows: 




1 
2 
3 
4 
5 



pole 

MS 

DR 

IS 

kin 



BLOCK FLLFE 

The block BLOCK FLIFE contains the lifetimes of mesons and hadrons in seconds. The standard for 
each line in the block should correspond to the FORTRAN format 

(Ix, 19, 3x, IP, E16 . 8, OP, 3x, ' #' , Ix, A) , 

where the first nine-digit integer should be the PDG code of a particle and the double precision number its 
Ufetime. 

BLOCK F CON ST 

The block BLOCK FCONST contains the decay constants in GeV. The standard for each fine in the block 
should correspond to the FORTRAN format 

(Ix, 19, 3x, 12, 3x, 1P,E16. 8, OP, 3x, ' #' , Ix, A) , 

where the first nine-digit integer should be the PDG code of a particle, the second integer the number of 
the decay constant, and the double precision number its decay constant. 

BLOCK FCONSTRATTO 

The block BLOCK FCONSTRATTO contains the ratios of decay constants, which often have less uncer- 
tainty than the decay constants themselves. The ratios are specified by the two PDG codes in the form 
f(codel)/f(code2). The standard for each line in the block should correspond to the FORTRAN format 

(Ix, 19, 3x, 19, 3x, 12, 3x, 12, 3x, IP, E16 . 8, OP, 3x, ' #' , Ix, A) , 

where the two nine-digit integers should be the two PDG codes of particles, the third and fourth integers the 
numbers of the decay constants, which correspond to the second index of the entry in BLOCK FCONST, 
and the double precision number the ratio of the decay constants. 
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BLOCK FBAG 

The block BLOCK FBAG contains the bag parameters. The standard for each line in the block should 
correspond to the FORTRAN format 

(Ix, 19, 3x, 12, 3x, IP, E16 . 8, OP, 3x, ' #' , Ix, A) , 

where the first nine-digit integer should be the PDG code of a particle, the second integer the number of 
the bag parameter, and the double precision number its bag parameter. 

So far no normalisation etc. has been defined, which at this stage has to be taken care of by the user. An 
unambiguous definition will be given elsewhere. 

BLOCK FWCOEF Q= . . . 

The block BLOCK FWCOEF Q= ... contains the real part of the Wilson coefficients at the scale Q. 

The different orders C\ have to be given separately according to the following convention for the 
perturbative expansion: 

The couplings should therefore not be included in the Wilson coefficients. 

The entries in BLOCK FWCOEF should consist of two integers defining the fermion structure of the 
operator and the operator structure itself. These two numbers are not thought to give a full representation 
including normalisation etc. of the operator, but merely correspond to a unique identifier for any possible 
Wilson coefficient. Consequently, the user has to take care that a consistent normalisation including 
prefactors etc. is indeed fulfilled. As an example, for the operator Oi, 

Oi = (s7,.T'^PLc)(c7'^T"Pi6) (6) 

the definition of the two numbers is given as follows. The appearing fermions are encoded by a two-digit 
number originating from their PDG code, where no difference is made between particles and antiparticles, 
as given in Table [T] Correspondingly, the first integer number defining 0\, containing the fermions scc6, 
is given by 03040405. The various operators are defined in Table [2] Correspondingly, the second integer 
number defining 0\, containing the operators '^^T°-Pi ^^T"-Pl is given by 6161. 

A few more rules are needed for an unambigous definition. 

• If an operators appears without fermions (as it is possible, e.g., for F^y) it should appear right-most, 
so that the encoded fermions correspond to the left-most operators. 

• In the case of a possible ambiguity, for instance Oi = {s^^T"'PLc){cj^T"'PLb) corresponding to 
03040405 6161 and d = {c-/^T''PLb){s-f^'T''PLc) corresponding to 04050304 6161 the "smaller" 
number, i.e. in this case 03040405 6161 should be used. 

The third index corresponds to each term in Eq. ([5]): 



00 : 




01 : 




02 : 




10 : 




11 : 
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name 


PDG code 


two-digit number 


name 


PDG code 


two-digit number 


d 


1 


01 


e 


11 


11 


u 


2 


02 




12 


12 


s 


3 


03 




13 


13 


c 


4 


04 




14 


14 


b 


5 


05 


T 


15 


15 


t 


6 


06 


Vr 


16 


16 






07 


Ell 




17 


J2q iQq 




08 






18 



Table 1: PDG codes and two-digit number identifications of quarks and leptons. The summations are 
over active fermions. 



operator 


number 


operator 


number 


operator 


number 


1 


30 




50 




70 


Pl 


31 


PlT'' 


51 


PiSij 


71 


Pr 


32 


PrT" 


52 


PrSij 


72 




33 




53 




73 


75 


34 




54 




74 


at"" 


35 




55 




75 




36 




56 




76 




37 


7^*75^'^ 


57 




77 


I^Pl 


41 




61 


YSijPL 


81 


I^Pr 


42 




62 


l^'kjPR 


82 


(tI'^'Pl 


43 




63 


<7^"'SijPL 


83 


af^'PR 


44 


a^'^T'^PR 


64 


ai'^S^jPR 


84 


YYYPl 


45 


ji-iYYT"- Pl 


65 


YYY^rjPh 


85 


i'YYPR 


46 


-ft'Yl'''T''PR 


66 


l^Yl'^iiPR 


86 
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Table 2: Two-digit number definitions for the operators. (a = 1 ... 8) denote the SU (3)c generators, 
Pl,r = i(l =F 75), and {T"')ij{T°')ki = liSudkj - l/NcSij6ki), where A;, Z are colour indices. 

99 : total 

The information about the order is given by a two-digit number xy, where x indicates 0{a^) and y 
indicates 0{as), and indicates cl^\ 

The Wilson coefficients can be provided either via separate new physics and SM contributions, or as a 
total contribution of both new physics and SM, depending on the code generating them. To avoid any 
confusion, the fourth entry must specify whether the given Wilson coefficients correspond to the SM 
contributions, new physics contributions or to the sum of them, using the following definitions: 

: SM 

1 : NPM 

2 : SM-fNPM 

The new Physics model is the model specified in the BLOCK FMODSEL. 

The standard for each Une in the block should thus correspond to the FORTRAN format 
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(Ix, 18, Ix, 14, 3x, 12, 3x, II, 3x, IP, E16 . 8, OP, 3x, ' #' , Ix, A) , 



where the eight-digit integer specifies the fermion content, the four-digit integer the operator structure, 
the two-digit integer the order at which the Wilson coefficients are calculated followed by the one-digit 
integer specifying the model, and finally the double precision number gives the real part of the Wilson 
coefficient. 

Note that there can be several such blocks for different scales Q. 

BLOCK IMFWCOEF Q= . . . 

The block BLOCK IMFWCOEF contains the imaginary part of the Wilson coefficients at the scale Q. The 
structure is exactly the same as for the BLOCK FWCOEF. 

BLOCK FOBS 

The block BLOCK FOBS contains the flavour observables. The structure of this block is based on the 
decay table in SLHA format. The decay is defined by the PDG number of the parent, the type of the 
observable, the value of the observable, the number of daughters and PDG IDs of the daughters. 
The types of the observables are defined as follows: 

1 : Branching ratio 

2 : Ratio of the branching ratio to the SM value 

3 : Asymmetry - CP 

4 : Asymmetry - isospin 

5 : Asymmetry - forward-backward 

6 : Asymmetry - lepton-flavour 

7 : Mixing 

8 : ... 

The standard for each line in the block should correspond to the FORTRAN format 

(Ix, 19, 3x, 12, 3x, 1P,E16. 8, OP, 3x, II, 3x, 19, 3x, 19, 3x, . . . , 3x, ' #' , Ix, A) , 

where the first nine-digit integer should be the PDG code of the parent decaying particle, the second 
integer the type of the observable, the double precision number the value of the observable, the next 
integer the number of daughters, and the following nine-digit integers the PDG codes of the daughters. It 
is strongly advised to give the descriptive name of the observable as comment. 

BLOCK FOBSERR 

The block BLOCK FOBSERR contains the theoretical error for flavour observables, with the structure 
similar to BLOCK FOBS, where the double precision number for the value of the observable is replaced 
by two double precision numbers for the minus and plus uncertainties. 

In a similar way, for every block, a corresponding error block with the name BLOCK FnameERR can 
be defined. 

BLOCK FOBSSM 

The block BLOCK FOBSSM contains the SM values of the flavour observables in the same format as in 
BLOCK FOBS. The given SM values may be very helpful as a comparison reference. 
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BLOCK FFORM 

The block BLOCK FFORM contains the form factors for a specific decay. This decay should be defined as 
in BLOCK FOBS, but replacing the type of the observable by the number of the form factor. It is essential 
here to describe the variable in the comment area. The dependence on can be specified as a comment. 
A more unambiguous definition will be given elsewhere. 

4.3 CONCLUSION 

The interplay of collider and flavour physics is entering a new era with the start-up of the LHC. In the 
future more and more programs will be interfaced in order to exploit maximal information from both 
collider and flavour data. Towards this end, an accord will play a crucial role. The accord presented 
specifies a unique set of conventions in ASCII file format for most commonly investigated flavour-related 
observables and provides a universal framework for interfacing different programs. 

The number of flavour related codes is growing constantly, while the connection between results from 
flavour physics and high pT physics becomes more relevant to disentangle the underlying physics model. 
Using the lessons learnt from the SLHA, we hope the FLHA will prove useful for studies related to flavour 
physics. It is planned to update/correct the FLHA after more experience with its application will have 
been gathered. 
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Part II 

TUNING 



5. STATUS OF RIVET AND PROFESSOR MC VALIDATION & TUNING TOOLS B 

The Rivet [46 1 package for MC generator validation and the Professor 1 47 1 system for generator tuning have 
become estabhshed tools for systematically verifying event simulations and optimising their parameters, 
where required and physically sensible. In this short report, we summarise the status and development of 
these tools. 



5.1 Rivet 

Rivet is an MC validation tool: it encodes MC equivalents of an increasingly comprehensive set of HEP 
collider analyses which are useful for testing the physics of MC generators. Rivet does not itself produce 
tunings, but provides a standard set of analyses by which to verify the accuracy of a given generator with 
a given tuning. 

Several fundamental design principles have been derived from the experience on Rivet's predecessor 
system, HZTool |j48jj49|, and from iteration of the Rivet design: 

• No generator steering: Rivet relies entirely on being provided, by unspecified means, with events 
represented by the HepMC pO| event record. 

• No generator-specific analyses: all Rivet analyses are specifically not allowed to use the generator- 
specific portions of the supplied event records. Apart from a few limited (and deprecated) exceptions, 
all analyses are based solely on physical observables, i.e. those constructed from stable particles 
(those with status 1) and physical decayed particles (those with status 2). 

• Rivet can be used either as a C++ library to be interfaced with generator author or experiment 
analysis frameworks, or as a command line tool (which itself makes use of the library interface). 
This is an example of the general philosophy to keep things simple and flexible, since we do not a 
priori know every task to which our tools will be employed. 

Internally, Rivet analyses are based on a comprehensive set of calculational tools called projections, 



which perform standard computations such as jet algorithms (using FastJet |51 1), event shape tensors, and 
a variety of other standard tasks. Use of projections makes analysis code much simpler, encapsulates 
any complexities arising from the ban on use of event record internal entities (the summation of photon 



momenta around charged leptons during Z-finding is a good example, see Section 14. i, and is more 
efficient than just using library functions, due to a complex (but hidden) system of automatic result 
caching. 

Users can write their own analyses using the Rivet components and use them via the Rivet API or 
command-line tool without re-compiling Rivet, due to use of an analysis "plugin" system. Separation 
between generator and Rivet on the command-line is most simply achieved by using the HepMC plain text 
lO-GenEvent format via a UNIX pipe (a.k.a. FIFO): this avoids disk access and writing of large files, 
and the CPU penalty in converting event objects to and from a text stream is in many cases outweighed by 
the general-purpose convenience. For generator-specific use of Rivet, the programmatic interface allows 
HepMC objects to be passed directly in code, without this computational detour. A sister tool, AGILe |52J, 
is provided for convenience control of several Fortran-based generators, with command-line and parameter 
file based run-time steering of generator parameters. This is a convenient tool when exploring generator 
parameter space as part of a tuning. 

Reference data for the standard analyses is included in the Rivet package as a set of XML files in the 
AIDA format. After several years of re-development as part of the CEDAR [5 2 J project, the HepData 



'"Contributed by: A. Buckley, J. M. Butterworth, H. Hoeth, H. Lacker, J. Monk, H. Schulz, J. E. von Seggem, F. Siegert 
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database of HEP experimental results can be used to directly export data files usable by Rivet from its 
Web interface at http : //hepdata . cedar . ac . uk;/| Analysis histograms are directly booked using 



the reference data as a binning template, ensuring that data and MC histograms are always maximally 
consistent. 

Rivet is in use within the MC generator development community, particularly in general-purpose shower 
MC programs, and the LHC experimental community, for MC validation and MC analysis studies which 
do not require detector simulation. 



5.11 Recent developments 

Rivet 1. 1.3 was released during the first week of this Les Houches workshop, in June 2009: this release 
includes many new analyses and fixes to existing analyses. Since the workshop, a huge number of extra 
improvements and developments have taken place in the run-up to the 1.2.0 release. Aside from many 
technical improvements, and the addition of a large number of QCD analyses (primarily for minimum 
bias and multi-jet physics) the major conceptual developments have been an emphasis on automated 
testing and validation of Rivet code, and the removal of hard-coded cross-section normalisations whenever 
possible. This latter step required more development than may be expected, due to the separation of 
generator and analysis: the HepMC record had to be enhanced to store cross-section information in a 
way which can be passed to Rivet. This has now been done, and recent versions of major generators such 
as Herwig++ |[T9j|21], Sherpa [22,23], and Pythia 8 [20|, support this HepMC feature "out of the box". 
AGILe's generator interfaces have also been updated to write cross-section information into their HepMC 
output. Determining scaling i^-factors where required is now performed via post-processing scripts, 
which automatically support common approaches to constraining this remaining degree of freedom. 

A technical development, but one worth mentioning, has been the emphasis on making Rivet analyses 
"self-documenting": each analysis has a structured set of metadata specifying name, authors, run conditions, 
a description, etc., which is used to provide interactive help, HTML documentation, and a reference 
section in the Rivet manual. 

At the time of writing, the final stage of systematic validation of Rivet for the 1.2.0 release is underway. 
The validation scripts used for this checking will henceforth be included in automatic build tests, to ensure 
that future developments do not unexpectedly change existing analysis functionality. The final major 
stage of development is the upgrade of Rivet's histogramming and data analysis code, which is currently 
rather basic. The upgrade will enable statistically accurate combination of runs, allowing for greater 
parallelisation of Rivet analyses which require large event statistics. 



5.2 Professor 

The Professor system builds on the output of MC validation analyses such as those in Rivet, by optimising 
generator parameters to achieve the best possible fit to reference data. The main description of Professor's 
details is found in reference | [47| , and we will not significantly replicate it here, except in the most 
high-level sense. 

Fundamentally, generator tuning is an example of the more general problem of optimising a very 
expensive function with many parameters: the volume of the space grows exponentially with the number 
of parameters and the CPU requirements of even a single evaluation of the function means that any attempt 
to scan the parameter space will fail for more than a few parameters. Here, the expensive function is 
running a generator with a particular parameter set to recreate a wide range of analysis observables, using 
a package such as Rivet. The approach adopted by Professor is to parameterise the expensive function 
based on a non-exhaustive scan of the space: it is therefore an approximate method, but its accuracy is 
systematically verifiable and it is currently the best approach that we have. 

The parameterisation is generated by independently fitting a function to each of the observable bin 
values, approximating how they vary in response to changes in the parameter vector. One approach to 
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fitting the functions would be to make each function a linear combination of algebraic tenns with n 
coefficients Ui, then to sample n points in the parameter space. A matrix inversion would then fix the 
values of Oj. However, use of a pseudoinverse for rectangular matrices allows a more robust coefficient 
definition with many more samples than are required, with an automatic least-squares fit to each of the 
sampled "anchor points": this is the method used by Professor. By aggregating the parameterisations of 
all the observable bins under a weighted goodness of fit measure - usually a heuristic x^- a numerical 
optimisation can be used to create an "optimal" tune. In practice, many different semi-independent 
combinations of MC runs are used to provide a systematic handle on the degree of variation expected in 
tunes as a result of the inputs, avoiding the problem that a single "maximum-information" tune may not 
be typical of the parameter space. 

The first application of Professor, due to its popularity and fairly well-understood steering parameters. 



was the Pythia 6 MC generator p8| . This was tuned in reference | [47| , using both of the available parton 
shower and multi-parton interaction (MPI) models, to data from LEP, SLD, and Tevatron Runs I and II. It 
was found that the parameterisation method worked well in all cases, and a range of systematic methods 
and tools were developed to check the accuracy of the approximations, such as line-scans through the 
parameter space. It was found that a sensible maximum number of parameters to be included in a single 
tune was ~ 10, and hence, there being ~ 20 Pythia 6 parameters relevant to the studied observables, 
we separated the tune into an initial stage of final state shower and fragmentation tuning using e+e~ 
observables, and then a second stage based on tuning initial state parton shower and MPI parameters to 
best describe hadron collider data. In these tunes, a quadratic parameterisation was used throughout, this 
being the simplest suitable function to account for parameter correlations. 

5.21 Recent developments 

The main development since the initial publication and use of Professor has been the application to more 
MC tunings. A first extra application was to use the same weightings as for the Pythia 6 tune to obtain 
additional tunings of Pythia 6 with different PDFs. The results of this study, shown at the PDF4LHC 
meeting in April 2009, indicated that modified leading order PDFs, developed for use with LO MC 
generators and characterised by a larger than normal low-x gluon component, drive statistical generator 
tunings in a physically expected direction: the main effect was to increase the screening of MPI effects 
proportional to the size of the gluon PDF at low x values. 

Moving away from Pythia 6, substantial tuning effort has been expended with the Jimmy |54 55) MPI 



simulation (used with the Fortran Herwig 6 code [ 17 1), and the Pythia 8 and Sherpa generators. In the case 
of Pythia 8, the default fragmentation settings are now those fixed by Professor, and use of Professor has 
identified a problem with description of underlying event (UE) observables in QCD events: this problem 
is being addressed. An extremely useful tool in this study was the prof -I GUI, which makes use of the 
Professor parameterisations not for minimisation, but for interactive mimicking of generator responses to 
parameter changes: this tool makes exploration of speculative tuning ideas easily testable, and helped to 
verify that no combination of parameters would achieve the desired effect in Pythia 8's description of UE 
observables. 

Another major effort has been the use of Professor to tune and develop the Sherpa generator's simulation 
of hadronisation and soft initial-state QCD physics. This collaboration has helped to rapidly iterate model 
improvements and debugging, due to the fast turnaround of tune information. 

Professor is additionally being used within the ATLAS, CMS, and LHCb LHC experimental collabora- 
tions for tuning studies of the main generators used for their MC simulation, and in plans for re-tuning to 
first LHC data at new centre of mass energies. 

Development of the Professor framework and application to different implications of tuning continues: 
the next contribution details how ensembles of tunes created by Professor may be used for estimations of 
tune uncertainties in MC predictions. Other suggested extensions to optimisation of observable definitions 
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or parameterisation of other expensive functions, e.g. observables in SUSY parameter space, remain open 
to exploration. 
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6. QUANTITATIVE ERROR ESTIMATION IN MC TUNES 

Recent developments in Monte Carlo generator tuning have led to more robust and general-purpose 
"optimal" tunes to existing data, and there is a clear hope that when existing data is well-described, 
extrapolations to future collider energies will also be reliable. However, especially in the area of soft QCD, 
generator predictions are never expected to be exactly accurate: there is no single "best" tune for a given 
model, but rather one or more regions of parameter space which contains reasonable tunes. The size of 
these regions reflects the degree of constraint which existing data is able to place on the model - discrete 
choices of model are also crucial to obtain a true sense of the total uncertainty on any generator prediction. 
In this contribution we present studies of how the mechanisms used for systematic generator tuning can be 
used to quantitively estimate the contributions to the uncertainty of MC predictions from several sources. 

A key example of generator uncertainty is the extrapolation of minimum bias and underlying event 
QCD physics to LHC design energies, i.e. ^/s » 2 TeV. The physics that drives the rise in activity is a 
combination of the non-perturbative total pp cross-section; the non-perturbative physics of beam remnants, 
diffraction, and multiple partonic scattering; and perturbative low-p^ QCD scattering. Accordingly, the 
most-used models are highly phenomenological and have many tweakable parameters: one interesting 
consequence of first LHC data in the multi-TeV energy regime will be the testing of whether these models 
extrapolate in agreement with nature. The expectation is that some models will fail the test! 

Previous studies of prediction uncertainty have been necessarily qualitative and subjective, since MC 
tunes have themselves been somewhat approximate affairs. The most common approach to assigning a 
systematic uncertainty has been to compare predictions from two different models, such as Pythia and 
Phojet in the case of Minimum Bias/Underlying Event extrapolation. Discrete choices of model remain 
a major source of uncertainty, even when the range of historic Pythia tunes with a Tevatron-excluded 
energy extrapolation is excludecj^- we encourage all extrapolated studies to make use of as many distinct 
(non-excluded) models as possible. In this contribution, however, we describe how to quantitively assess 
major sources of uncertainty arising from the tuning process itself, i.e. the reasonable scatter to be 
expected around a tune for a discrete model. Our baseline for this approach is the Professor tuning system, 
which we now summarise. 

6.1 MC tuning with Professor 

The "Professor" approach to MC tuning constitutes both a numerical method and a suite of tools which 
implement it. Fundamentally, Professor attempts to parameterise expensive functions - the bin values 
in a set of MC observables - by least-squares fitting of the parameterisation coefficients. The least- 
squares minimisation is made more approachable by use of the pseudoinverse method, implemented via 
a matrix singular value decomposition. Armed with a fast analytic model of how every bin of a large 
set of observables will respond to variations of the generator parameters, numerical optimisation of the 
generator's fit to reference data may be efficiently computed. A detailed description may be found in 
reference [47] . 

The benefit of this approach is clear for more than two parameters: Professor requires as input the values 
of observables for a moderately large number of MC runs distributed suitably in the generator parameter 
space, each point in the space perhaps requiring 0(48) CPU hours to complete. A serial optimisation 
approach such as Markov chain sampling would hence require thousands of CPU days to have a chance 
of converging, if the generator itself is not batch-parallelised. Professor, given sufficiently large batch 
computing resources, can trivially parallelise the generation of the input MC points for any generator and 
thereafter complete the parameterisation and fit optimisation in negligible time, allowing for scaling to 
higher numbers of tune parameters than could be attempted by methods which require iteration of the 

"Contributed by: A. Buckley, H. Hoeth, H. Lacker, H. Schulz, J. E. von Seggern 

'^This includes the default Pythia tune, the ATLAS MC08 tune, and (by construction) the Tune *T series from Rick Field. 
Essentially, any setup with PARP(90) < 0.2 is in contradiction with Tevatron data. 
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Fig. 5: A scatter plot of vs. parameter value for a set of Professor run combinations on three Pythia6 
parameters. Implicitly, this projection of tune parameter vectors on to parameter axes gives a qualitative 
measure of whether or not a parameter is well-constrained: these parameters become increasingly ill- 
defined from left to right. The different markers represent different degrees of oversampling, with the star 
representing the maximum information run — the points are for the same run combinations in all three 
scatter plots. 



time-limiting step. 

6.2 Qualitative error estimation in Professor 

An important feature of the Professor method is that it has always allowed for qualitative assessment of the 

tune robustness. A per-bin parameterisation in p parameters will require a minimum number of MC runs, 

(p) 

N^^^, for the least-squares pseudoinversion to be performed. For robustness it is advisable to oversample 
this minimum requirement by a factor 0{3) (or more, especially for large p) such that a tune will in fact 
use » ^min i^put ruus. Additionally, since the parameterisation and optimisation steps are fast, we 
take the opportunity to make many such overconstrained tunes by in fact sampling an even greater number 
of runs, ATsampied 3> N. We can then randomly sample a large number of mostly-independent A^-run tunes 
to obtain an ensemble of reasonable tunes - again, this step can be trivially batch-parallelised. The spread 
of this tune ensemble as projected on each parameter has been used in several Professor MC tunes as a 
heuristic for determining whether a parameter is well or poorly constrained, for detecting parameterisation 
problems, and for ensuring that the "maximum information" tune is typical of the ensemble. Several other 
checking methods, such as eigenvector Une scans, are also used to ensure that the details of the tune, and 
particularly the generator parameterisation, are reliable. 

To make a prediction of an observable for which there is no reference data (this may be an energy 
extrapolation, or simply an unmeasured feature at existing energies), the simplest approach is of course to 
run the generator with the obtained tune(s) and compute the observable. Using the many tunes resulting 
from the different run combinations will give a spread in the observable prediction, reflecting one part of 
tune uncertainty. In practice, since we can build a fast pai^ameterisation of the generator behaviour (in 
fact many of them) on the unfitted observable as part of the main Professor tuning process, this offers a 
much faster turnaround than processing another large (perhaps very large) set of generator runs - with the 
proviso that the parameterisations are of course non-exact. 

6.3 Sources of tune uncertainty 

Before making this method more quantitive, we now consider the various sources of uncertainty in the 
procedure outlined so far. This will help us to understand which sources of uncertainty are computationally 
controllable and which will have to, for now, remain more nebulous. These main sources of tuning error 
are as follows: 

1 . Error on experimental reference data. 
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2. Statistical error on the MC at the anchor points from which the parameterisation is constructed. 

3. Systematic limitations of the parameterising function to describe bin responses to parameter 
variations - pathological MC parameters with discontinuous or critical behaviour are particularly 
hard to generically parameterise, since a Jacobian transformation to a suitable meta-parameter is 
not always easily available. 

4. Choice of run combinations to make the parameterisation. 

5. Goodness of fit definition, including both the type of GoF measure and the choice and relative 
weighting of different data. 

6. Reasonable minimiser scatter within the valley containing the optimal tune point for a given 
parameterisation. Note that this cannot be completely disentangled from the role that error sources 
1, 2 and 3 play in defining the ^ valley. 

7. Limitations of the parameterisation(s) used to compute the parameterised MC value in extrapo- 
lated/unfitted observables. Of course, this error doesn't exist if the less convenient strategy of 
re-running the MC generator is used. 

8. For completeness, we again highlight the systematic error associated with the discrete physics model 
being tuned: the total error is far from complete without considering more than one viable model. 
Within a given model there are also systematic uncertainties, some of which may be quantified, e.g. 
cross-section integration uncertainty and PDFs: the second of these is particularly quantifiable due 
to the existence of error or replica PDF sets, themselves expressing reasonable variations in PDF 
fitting. 

Note that, for example, these sources of uncertainty such as the variation between members of the 
ensemble of run combinations are not unique errors introduced by the Professor approach: failing to test 
different run combinations does not eliminate the error associated with the choice of anchor runs used! A 
similar, but unquantifiable, error exists for any form of manual tuning. 

6.4 Construction of tune uncertainty confidence belts 

Our approach to quantifying the uncertainties from (combinations of) the sources listed in the previous 
section is to construct central confidence belts for observable bins from the various ensembles of tune 
results, parameterisations, etc. we have described. Explicitly, given a large number of reasonable and 
equivalent predictions for an observable bin value, we construct a band of given P-value as being the 
region containing fraction P of predictions, with equal fractions above and below. 

There are many ways to construct such ensembles - for the purposes of this study we identify three: 

Combination error: The ensemble from which we construct the confidence belt is simply the ensemble 
of predictions from different run combinations. This will hence represent the variation due to error 
sources 1, 2, and 4 - the other sources of uncertainty exist, but are not quantified by this approaclj'^This 
approach requires that correlations between different run combinations are small, which is ensured by the 
-^sampled ^ ^ requirement.If parameterisation (as opposed to explicit MC runs) is used for the translation 
of this tune ensemble into bin value predictions, then source 7 also applies. This can be included into the 
band construction by using many parameterisations, again constructed from run combinations. Different 
parameterisations should be used for minimisation and prediction to avoid reinforcing parameterisation 
systematics. 

Statistical error: The obvious failure of the combination error approach is that error source 6 - the 
measure of reasonable tune variation within the valley - is left unquantified. Hence it will be no 

'^MC error is currently not explicitly propagated into Professor's fit measure, due to stability problems: it enters implicitly via 
the statistical scatter of MC samples. This is being remedied. 
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Fig. 6: Two dimensional illustration of the parameter point 
sampling used for the statistical uncertainty estimate. We 
exploit the covariance matrix returned by the minimiser for 
a Gaussian sampling from the corresponding p-dimensional 
hyper-ellipsoid. The fjj (i = 1 . . . p) are the eigenvalues ob- 
tained from an eigen-decomposition of the covariance matrix. 



surprise that our "statistical" error band is constructed explicitly from an ensemble of samples from this 
valley. This is obtained in the simplest case by only using the maximum information Professor tune - 
that which is constructed from all the available MC runs. The covariance matrix of the parameters in the 
vicinity of the tune point is obtained - in principle directly from the parameterisation, in practice from the 
minimiser - and used to define a rotated hyper-Gaussian probability distribution in the parameter space. 
Sampling parameter points from this distribution gives us another ensemble of tune points and, as for the 
combination error, they can be mapped into observable predictions either by direct MC runs or by using 
one or many constructed parameterisations. In practice, we take advantage of the combined potential 
of the hyper-Gaussian sampling and MC parameterisation to build a confidence belt from 0(10, 000) 
samples. The quantified sources of error are hence 1, 2, 6, and 7. 

These approaches to error band building will be used in the next section to construct sample error bands 
for underlying event predictions. Although not currently implemented in Professor, we also highlight the 
most complete form of quantifiable error band within the Professor approach: 



Combined error: This extension is an obvious fusion of the above two ensemble/band constructions: 
as in the combination error, we construct an ensemble of points from run combinations, then for each 
run-combination point we construct a statistical ensemble. The combined ensemble of tune ensembles, 
and a variety of parameterisations to transform them into predictions will lead to error bands quantifying 
error sources 1, 2, 4, 6, and 7. 

The remaining error sources are 3, 5 and 8: limitations of the parameterisation, the observable 
weights/goodness of fit definition, and - most importantly - the uncertainty due to different physics 
models. These missing systematics remain qualitative in this scheme, and reliable MC predictions should 
take care to include estimates of their influence, albeit in a more ad hoc fashion than for the more 
statistically-induced errors. 



6.5 Results 

We now briefly present results using the first two confidence belt definitions presented above - ongoing 
work is addressing the "combined" belt definition and inclusion of PDF uncertainties. 



Our exploration is based on tunes of the Jimmy |54 55 1 MC generator, which simulates multiple 
parton interactions (MPI) for Herwig 6 [17], because it has only two relevant parameters, PTJIM and 
JMRAD(73|^- the frugality with parameters makes Jimmy an ideal "toy model" testbed, while remaining 



phenomenologically relevant. As a Jimmy-like MPI model is ruled out by Tevatron data 1 56 1, we fix a 



dependence of PTJIM on the centre of mass energy with the same ansatz as used in Pythia 6 1 18 1 



0.274 



PTJIM = PTJIMisoo • , (7) 

I 1800 GeV,' 



'"'We treat the inverse radius-squared of the protons, JMRAD(73), to be identical to that of the anti-protons, JMRAD(91), and 
ignore the interplay with ISR parameters in Herwig 6 itself. 
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(b) Statistical uncertainty, data and pseudodata 
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(d) Combination uncertainty, data and pseudodata 



Fig. 7: "Statistical" and "combination" error bands for transverse N^h flow at 7 TeV before and after 
adding IM events of pseudodata of this observable (black markers) to the tuning. The error bands are 
calculated from the central 95 (68)% of the binvalues of an ensemble of 10000 histograms each. 



where PTJIM 



1800 



is the value of PTJIM at the reference scale 



= 1800 GeV and is the parameter 
actually used in the tuning process. Furthermore, we use the MRST LO* PDF set |57| and use Tevatron 
data from CDF |j58j-|60J and D0 (61) as a tuning reference. A more complete tune would include the 
exponent of the p ™" energy dependence, but for toy-study purposes we here fix it to a value consistent 
with known energy extrapolation fits |47 62 1. 

In Figures 7(a) and 7(c)[ the statistical and combination error band definitions are shown for a 7 TeV 
underlying event observable - A'^^h flow transverse to the leading track (track with the largest transverse 
momentum in an event), as a function of leading track p^- computed in Rivet | |46| |, based on the fit 
to Tevatron reference data. The error due to variation of run combinations (error source 4) is notably 
somewhat larger than the scatter of points in the valley (error source 6), indicating that in the Jimmy 
model the parameters have a strong influence on this observable, and are hence well-constrained. In 
Figures 7(b) and 7(d) similar band constructions are shown, but in this case the same Rivet analysis has 
been used to simulate the effect of adding IM events of UE data at 7 TeV into the fit: the size of both 
error bands is reduced, as expected. 



6.51 Effect of extrapolation 

Finally, we consider systematically how error bands constructed in this way behave as extrapolations are 
taken further from the region of constraining data. In this case, since the computational requirements are 
significant, we only consider the "statistical" error band. 

We use the transverse region Nch density UE observable, evaluated at a range of ten centre of mass 
energies, ^/s^, between 200 GeVand 14 TeV. We then apply the following procedure, 
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(b) A'ch vs. Px"''' mean of plateau vs. 



Fig. 8: Transverse region charged particle density, (a) the typical plateau observed and (b) mean-height 
of that plateau as a function of ^/s. 
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Table 3: Definition of plateau regions (pY'^'^) used in the extrapolation study. 



Construct the maximum-information parameterisation of the generator response for the A'ch UE 
observable, f{-\fs^) 



Produce an ensemble of 10000 histograms, Hi, of the observable shown in Figure 8(a) (blue line) 
using the corresponding f{y/si) and points sampled using the procedure illustrated in Figure [6] 
Calculate the mean height of the Nch plateau. Mi, for each of the Hi 
Construct a 95% central confidence belt, CL{^/Sj) from the M, 



In Figure 8(b) the CL(-y/sJ are drawn. We observe a very tight confidence belt for the energy region of 
the Tevatron experiments, while the confidence belt becomes wider for extrapolation to LHC energies. 
The definitions of the plateau regions used can be found in Table [3] 

It is notable that these bands are narrow - sufficiently so that they have been visually inflated by a factor 
of 10 in the figure. While this reflects good stability in the tuning system, it is probably an underestimate 
of the true model uncertainty. A more complete study will include the exponent of energy extrapolation in 
equation (|7]) in the tune, since this may be a dominant effect in the errors for this particular observable 
and its inclusion will give more freedom for various features of the model to balance against each other: 
the toy tuning of the energy-dependent Jimmy model shown here is probably too restrictive to accurately 
represent the full range of variation allowed by the model, but serves as an indication of the extent to 
which such studies can be systematised. 



6.6 Conclusions 

We have catalogued a set of sources of uncertainty which either explicitly or implicitly contribute to 
any tune of a MC event generator, and presented a systematic approach to quantifying many of these 
uncertainties using the fast MC parameterisations and natural tune ensembles which arise from the 
Professor tuning approach. Example results have been shown, which exhibit some expected behaviours, 
such as the shrinking of error bands on adding new reference data in new areas of parameter space and the 
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blow-up of error bands as predictions venture further into unconstrained regions. 

Several things should be emphasised: first and most important is that this approach does not catch 
all sources of error. We have presented results from two definitions and have proposed a third, more 
comprehensive measure, but still variations such as PDF errors and discrete model variations need to be 
included. However, with the increased usage of systematic tuning methods, variations between models in 
UE observables are not as substantial as once they were - statistical errors are a non-negligible factor in 
assessing the reliability of phenomenologically-based MC predictions. 

The approach taken here has many obvious parallels in the world of PDF errors, with our approach 
having a good deal of overlap with the MC replica set approach of the NNPDF collaboration |63 1 as 
contrasted with the eigenset approach of the CTEQ and MRST/MSTW collaborations. While replica 
sets have the advantage of a more direct statistical uncertainty interpretation (although we do not have 
the option of the parameterisation-freedom exibited by the NNPDF use of neural networks), there is the 
pragmatic issue that 0(10) representative error tunes would be more usable than 0(10000) equivalent 
tunes. Whether such a concept can be statistically constructed remains to be seen - for now the Perugia- 
soft/hard tunes remain the obvious tool of choice. 
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7. MATRIX ELEMENT CORRECTIONS AND PARTON SHOWER MATCHING IN INCLU- 
SIVE Z PRODUCTION AT LHClE] 

7.1 INTRODUCTION 



In this paper we compare Pythia 6 |18], AlpGen |64| and Sherpa |23 65 1 in inclusive Z production 



at LHC, with 10 TeV center of mass energy. A disagreement in various observables, especially the Z 
transverse momentum, -pT, between AlpGen and Pythia 6 was recently reported in |66|. Namely, the 
shape in the low Z pT region was significantly different when comparing AlpGen showered with Pythia 6 
and matrix element corrected Pythia 6. This was traced back to a change in shape in Pythia 6 when matrix 
element corrections are switched off (this setup is used when showering AlpGen events, because higher 
multiplicity AlpGen samples already provide higher order corrections). This is summarized by the Z pT 
plot shown in Fig. [9] A bug in Pythia 6 causing this behavior was later corrected; here we present results 
obtained with the new version of Pythia 6. 

We also compare different Pythia 6 tunes, checking both the virtuality ordered and the newer transverse 
momentum ordered shower. 

Using Pythia 6 as a reference we check the performances of the matching in AlpGen and Sherpa. Since 
Pythia 6 is fully corrected for first order real emission, we expect agreement with AlpGen and Sherpa 
when those are configured to include matrix elements for Z plus up to one additional parton. 
Results presented in this work are obtained using Pythia 6 versions 6.41 1 and 6.421 (the last contains the 
bug fix for the matrix element corrections problem), AlpGen 2.13 and Sherpa 1.2.0. 



7.2 ANALYSIS SETUP 

All generators have been processed through the same analysis, selecting events with a Z with mass between 
66 GeV and 116 GeV, requiring the two leptons to have pT greater than 20 GeV and pseudorapidity 
between -2.5 and 2.5. Jets are reconstructed with the k± algorithm, with a radius of 0.4 and a minimum 
Pt of 30 GeV. In order to decouple effects due to multiple interactions, hadronization and QED radiation 
off leptons, these have been switched off for all the generators used. 

The analyses were carried out using Rivet (see Section[S]l. 

'^^Contributed by: P. Lenzi, P. Skands 



PYTHIA ME-corr 
PYTHIA no ME-corr 
AlpGen Q =20GeV 
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Fig. 9: Z transverse momentum spectrum for Pythia 6 (corrected and uncorrected) and for AlpGen plus 
Pythia 6 as in [66] . AlpGen plus Pythia 6 follows uncorrected Pythia 6 at low pT- The change in shape of 
uncorrected with respect to corrected Pythia 6 was responsible for the disagreement. 
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7.3 MATRIX ELEMENT CORRECTIONS IN PYTHIA 



Matrix element corrections in Pythia 6 are described in detail in 1 18 1. Here we just remind the basics of the 
procedure. Pythia 6 modifies the shower in two steps: first the starting scale is raised to the hadronic center 
of mass energy (power shower configuration) so that any hard emission from the shower is kinematically 
possible; then the probability for the first emission (which is also the hardest in Pythia 6) is re-weighted to 
include the first order real emission contribution. The reweighting factor is calculated as the ratio between 
the exact real emission matrix element and the splitting function used in the shower. 



The effect of matrix element corrections in Pythia 6.41 1 is summarized in Fig. 10 for Z transverse 
momentum spectrum. The three curves are obtained with full matrix element corrections, with power 
shower only, and with uncorrected shower with starting scale at the Z mass. The relative difference 
with respect to the ME corrected result is shown in the lower plot. The shape of the spectrum changes 
significantly when matrix element corrections are switched off, even at low -pT (the relative difference plot 
does not flatten as goes to 0). This is unexpected, as the matrix element effect is supposed to go to zero 
at low pr, where the description from the shower is already accurate. In fact, the splitting function used in 
the shower is the approximation for pj- — of the of the exact matrix element for emission of one parton, 
so no difference between the two descriptions is expected in that region.The effect of matrix element 



corrections is illustrated in Fig. 1 1 for the newer 6.421 version of Pythia 6. The same collection of curves 
as for version 6.411 is shown. With the newer version the deactivation of matrix element corrections has a 
very small effect at low p-p, as expected. A bug was found and corrected between the two versions which 
was causing the behavior observed in the earlier version. 

7.4 PYTHIA TUNES 

We compared various tunes in Pythia 6.421 on the basis of three observables, the Z transverse momentum, 
the jet multiplicity and the leading jet transverse momentum. In all of our comparisons we switched off 
the multiple interactions simulation, which means that we are comparing the effect of different tunings on 
the hard event simulation and on the shower 



We compared the following tunes: DW |67|, D6T |67|, Pro-QO |68|, PerugiaO |69|. The virtuality 




20 40 60 ^iO 100 120 140 

[GeV] 



Fig. 10: Z transverse momentum spectrum for three different matrix element correction settings in 
Pythia 6.4 1 1 : with matrix element corrections, with power shower only, without matrix element corrections. 
The relative difference with respect to the matrix element corrected curve is shown in the lower plot. 
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Fig. 11: Z transverse momentum spectrum for three different matrix element correction settings in 
Pythia 6.421: in this new version, deactivating matrix element corrections has a small effect at low pr, as 
expected. 



ordered shower is used in the first three, while the last uses the px ordered shower. 



Z transverse momentum spectra obtained with the four mentioned settings are shown in Fig. 12 



The agreement is generally good, tune DW and D6T agree fairly well on the whole spectrum; some 




discrepancies are observed in the low pT region: both tune PerugiaO and tune Pro-QO differ from Tune 
DW, and they also differ from each other. The disagreement with respect to tune DW is up to 20%. 



Jet multiplicity and leading jet transverse momentum spectrum are shown in Fig. 1 3 The jet multiplicity 
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for Tune PerugiaO shows some not negligible differences with respect to the other tunes, predicting more 
jets. On the other hand Tune DW, D6T and Pro-Q20 predict very similar rates. Concerning the leading jet 
Pt spectrum in Fig. 13 (b), the shape predicted by the four tunes is very similar. 




Fig. 13: (a) Jet multiplicity, (b) leading jet p-p spectrum for four Pythia 6 tunes. Relative difference is 
shown with respect to Tune DW. 



7.5 Pythia 6, Sherpa, AlpGen COMPARED 

In this section we compare Pythia 6 (including full matrix element corrections) with AlpGen and Sherpa. 
We used Pythia 6 version 6.421, both standalone and to shower AlpGen events. Also, we used tune DW 
for Pythia 6. 

We used matrix element corrected Pythia 6 as a reference for a consistency check of the matching 
prescriptions used in AlpGen and Sherpa. Both the CKKW |4| prescription used in Sherpa and MLM 
prescription used AlpGen [64} 70 1 can describe multiple parton emission corrected for the corresponding 
multi-parton matrix element. If just one additional parton emission from the matrix element is permitted, 
those prescriptions should give results compatible with Pythia 6. 

While CKKW and MLM divide the phase space in a region of jet production, populated by the matrix 
element, and a region of jet evolution populated by the shower, the matrix element correction prescription 
implemented in Pythia 6 does not depend on any separation parameter, thus providing us with the "correct" 
reference to test the other matched calculations. 



The comparison among the three generators is shown in Fig. 14 for the Z transverse momentum. The 
agreement in shape is generally good, well within 20% with respect to Pythia 6 for most of the spectrum. 
In paiticular, the mismatch between AlpGen and Pythia 6 observed in [66] is practically gone with the 
newer version of Pythia 6. This is an important result also because it establishes the re-usability of Pythia 6 
tunes between standalone Pythia 6 and AlpGen plus Pythia 6 shower. 



Jet multiplicity and leading jet px spectra are shown in Fig. 15 In (a) both AlpGen and Sherpa appear 
to have a longer tail for high jet multiplicity. Concerning the leading jet pT spectrum in (b), Sherpa 
follows Pythia 6 all over the spectrum, AlpGen appears so have a harder tail for high px- 



Differential jet rates for the 1— s-O, 2— s-l and 3^2 transitions are shown in Fig. 16 Differential jet 
rates for the transition n — )• n — 1 are the distributions of the Ut resolution parameter in an exclusive 
kx algorithm, Qn~^n-i, that makes an n jet event turn into an n — 1 jet event. AlpGen and Pythia 6 are 
similar, while Sherpa's shape shows some not negligible differences. Sherpa uses a dipole shower, while 
Pythia 6 uses a virtuality ordered shower (the same shower is also used with AlpGen). The source of the 
difference is likely in the different shower, which fills the phase space with a different pattern. 
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(a) (b) 
Fig. 15: (a) Jet multiplicity, (b) leading jet pT spectrum. 



— ^ Pythia tune DW 
— — Sherpa 1.2.0 
Alpgen+Pythia 



(a) 



Pythia tune DW 
— Sherpa 1.2.0 
Alpgen+Pythia 



(b) 









Pythia tune DW 
— Sherpa 1 .2.0 






"'hi. 


Alpgen+Pythia 














•t 








\ 










t 


8:1 

























(c) 





Fig. 16: Differential jet rates: (a) 1-^0, (b) 2-^1, (c) 3-^-2. 



7.6 CONCLUSION 

We studied matrix element corrections in Pythia 6 for inclusive Z production at the LHC and we compared 
Pythia 6, AlpGen and Sherpa on the ground of first order real emission corrections. A bug was recently 
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discovered and corrected in inclusive vector boson production in Pythia 6, that made the effect of matrix 
element connections very big also at low boson px- We compared two versions of Pythia 6, before and 
after the fix, and observed that the low boson momentum region is correctly described in the new version, 
the matrix element corrected and uncorrected shower being similar in that region as expected. 

We compared four different Pythia 6 tunes. For all of them we switched off the multiple interactions 
in order to observe the effect of the tune on the shower and on the hard event simulation. Generally 
good agreement was observed, with some differences especially in the low Z pT region and in the jet 
multiplicity spectrum. 

We showed various plots comparing Pythia 6, AlpGen and Sherpa. Differences between AlpGen plus 
Pythia 6 and matrix element corrected Pythia 6 observed in 1 66] were due to the bug affecting Pythia 6, 
and are now much reduced. This also assesses the re-usability of Pythia 6 tunes when Pythia 6 is used to 
shower AlpGen events. 

7.7 PARAMETERS AND SETTINGS 

In Pythia 6, the underlying event was switched off with MSTP ( 81 ) =0 (MSTP ( 81 ) =20 for the tune 
PerugiaO, that uses the pT ordered shower). Matrix element corrections were controlled with the parameter 
MSTP { 6 8 ) : this was set to =0 to switch off matrix element corrections and to set the staring scale to the 
Z mass, to =2 for the power shower without matrix element corrections. The default had matrix element 
corrections enabled. Hadronization was switched off with MST J ( 1 ) =0. The various tunes were selected 
by acting on parameter MSTP ( 5 ) . 

AlpGen samples were produced with a minimum parton pT of 20 GeV. We used the default parameters 
for Sherpa, with no underlying event, no fragmentation, no QED radiation off leptons. 
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Part III 

BEYOND FIXED ORDER 



8. MULTIPLE PARTON INTERACTIONS AS A BACKGROUND TO TOP PAIR PRODUC- 
TION E3 
8.1 INTRODUCTION 

The occurrence of Multiple Parton Interactions (MPI) in hadronic collisions is by now well established 
experimentally |71-74|. Furthermore, they represent the basic mechanism in the description of the 



Underlying Event (UE) in the standard Showering Monte Carlo's (MC) like Pythia 6 ||69||75]|76p and 
Herwig [54[|77||. MPI rates at the LHC ai^e expected to be large, making it necessary to estimate their 
contribution to the background of interesting physics reactions. On the other hand, their abundance makes 
it possible to study MPI experimentally in details, testing and validating their implementation in the MC. 

In this note we discuss the role of MPI in top-antitop production at ^/s = 10 TeV. The LHC will 
be a top-antitop factory and the large rate will allow accurate measurements of the top mass, one the 
most crucial parameters for stringent tests of the SM, and production cross section. It will therefore be 
of extreme importance to have full control over all potential background processes. In this study we 



compare the results obtained with Pythia 8 |20 1 with those obtained at parton level with the methods of 



Refs. 1 78 79| which suggested that MPI could provide a significant background to top-antitop production 



and to other interesting processes like W + Aj and Z + 4j production. 



8.2 SETUP 

If no 6-tagging is assumed, the MPI processes which provide a background io ti and more generally 
contribute to + 4j through Double Parton Interactions (DPI) are: 



jj(g)jjW, jjjCSijW, jjjj(S)W (8) 

where the symbol stands for the combination of one event for each of the two final states it connects. 

Besides in W decay, isolated high-p^ leptons can also be produced in the decay of heavy quarks. 
Therefore any reaction in which heavy quarks plus jets are produced can fake the signature of a W boson. 
Keeping in mind the large QCD cross-section for bb production, heavy quark and QCD DPI processes 
deserve attention as possible backgrounds to top pair production. This corresponds to the following extra 
processes: 



bb(g)jj, bbj(g)jj (9) 

In this study we have decided to use Pythia 8 for simulating DPI events. Pythia 8 allows to use external 
Les Houches Accord |3] input for the hardest process, and to choose an internal one for the second 
hardest, being however limited to 2 — )• 2 processes. This set-up allows a precise study of dedicated DPI 
configurations. 



Moreover we have generated a number of samples, most notably W + 2j, with MADEVENT 1 80 81 1, 



for which DPI has been simulated by either Pythia 8 or by combining pairs of parton-level events as in 



Refs. 178 791 . 



In summary, these are the samples that we have generated with Pythia 8: 

tt, Wj0jj, Wjj{MG)(^jj, bb(^jj, bbj{MG)(^jj (10) 



Contributed by R. Chierici, E. Maina 
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where with Wjj{MG) and bbj{MG) we indicate that the corresponding events have been produced 
starting from an external Les Houches file created by MADEVENT . In all cases the second hard interaction 
switched on in Pythia 8 is a generic 2 — 2 QCD process. The default Pythia 8.130 parameter setup 
and tunings have been used. All the events processed with Pythia 8 have been fully showered, and jet 
clustering run over the events to have the possibility to apply a more realistic event selection. 

We have also generated at parton level all reactions contributing to jj (g) jjW, jjj jW and jjjj W 
with MADEVENT. 



All samples have been generated using CTEQ6L |82| parton distribution functions and with the 
following parton level cuts: 

PT^ > 10 GeV, \7]j\ < 5.0, ARjj > 0.001 (11) 
where j = u, u, d, d, s, s, c, c, b, b, g. 

The DPI cross section for the parton level samples has been estimated as 

(J = ai-a2/(yeff/k (12) 

where ai , a2 are the cross sections of the two contributing reactions and A; is a symmetry factor, which 
is equal to two if the two reactions are indistinguishable and equal to one when they are different. 

At the Tevatron, CDF |73 1 has measured cje// = 14.5 ± 1.7]^2 3 "i^, a value confirmed by D0 which 



quotes o"e// = 15.1 ± 1.9 mb f74|. In Ref. [83 ] it is argued, on the basis of the simplest two channel 
eikonal model for the proton-proton cross section, that a more appropriate value at =1.8 TeV is 10 
mb which translates at the LHC into (^eff'^ = 12 mb. Treleani then estimates the effect of the removal by 
CDF of Triple Parton Interaction events from their sample and concludes that CDF measurement yields 
o"e// ~ 11 mb. In the following we use a^ff = 12.0 mb with the understanding that this value is affected 
by an experimental uncertainty of about 15% and that it agrees within about 10% with the predictions of 
the eikonal model. 

The DPI picture in Pythia 8 assumes that interactions can occur at different p^- values independently of 
each other inside inelastic nondiffractive events. The expression for a DPI cross section becomes therefore: 



=< fimpact > 0-i-a2/(TND/k (13) 

where crTvi) is the total non-diffractive cross section and /impact is an enhancement/depletion factor chosen 
event-by-event to account for correlations introduced by the centrality of the collision. This quantity is 
typically averaged during an entire run to calculate < /impact > in Eq. 1 3 Typical values at the centre of 
mass energy of 10 TeV are 1.33 for < /impact > and 51.6 mb for a]\fD- Comparing Eq. 13 with Eq. 12 



tells us that Pythia 8 sort of predicts an effective (Jeff=(JNDl< /impact > which is about a factor 3 larger 
than the one actually measured at the Tevatron. 

Table |4]reports the cross section breakdown for the samples used in this study and processed by Pythia 8. 
All cross-sections are leading order as returned by the programs used. The Wjj sample generated with 
MADEVENT has also been combined with matrix elements QCD events, evaluating the cross section 
according to Eq. [12} we will call this sample as Wjj (E> jj ME, as opposed to the Wjj (g) jj P8 sample. 



8.3 RESULTS 

To mimic an experimental analysis, showered events are first required to have an isolated, central and 
energetic lepton according to the cuts: 

PT, > 20 GeV , \rji\< 2.5 , Eiso < 20 GeV (14) 

with obvious notation and where Eiso is an isolation variable defined as the total energy flowing in a 
cone of opening angle AR=0. 1 around the lepton. Later in this study we have made this cut more severe 
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Process 


o-(nb) 


cr(«>ji)(pb) 


O-acc(pb) 


tt 


0.24 


no DPI 


60 


lu)jj{MG) 


13.7 


140 


23 


Wj 


4.8 


12 


2.5 


bb 


850 


2200 


18 


bbj{MG) 


688 


7300 


69 


bbjjiMG) 


472 


no DPI 


4012 


liy)jj{MG) 


13.7 


no DPI 


627 



Table 4: Total cross-sections for the processes studied in this contributions. Samples labelled with MG 
are made with the use of MADEVENT. The second column presents the cross sections as returned by the 
generators, the third column the corresponding cross sections after having added a second hard interaction 
to the process in Pythia 8, and the last column the accepted cross sections after the selection described in 
Eq.fT4l 



by increasing AR to 0.5. We will also compare the effect of different jet px thresholds. To be closer 
to an experimental selection, only electrons and muons are considered in this study. In events with a 
candidate lepton, at least four jets with transverse momentum above 20 GeV and \r]\ < 5 are requested. 
No b-tagging requirement is mimicked. The accepted cross sections of the top-pair signal and the DPI 
backgrounds are reported in Table |4] In comparison we have also reported the accepted cross sections for 
two equivalent non-DPI background processes. 

It has to be mentioned that for the DPI events obtained by the simple combination of parton level 
events as in Refs. |78j7^, where no parton shower and fragmentation is present, the above cuts have been 
replaced by equivalent ones. The lepton isolation, for instance, becomes a cut in AR between the lepton 
and any jet. 



In Figure 17 we present a few distributions at generator level after the event selection in Eq. 14 We 
decided to compare the top-pair signal with the Wjj process alone and with DPI Wjj (8) jj in both the 
ME and P8 configurations. All histograms in the plots are normalised to the number of events expected 
in 100/pb of integrated luminosity, with the exception of the Wjj contribution that has been scaled by a 
factor 10 to make the plot more readable. The plots shown represent, respectively, the highest and softest 
(among four) px of the jets in the event, the maximum pseudorapity difference among the four jets, the 
azimuthal difference between the jets complementary to those with the maximum azimuthal difference, 
the scalar sum of the transverse momenta of the four highest px jets, and the transverse momentum of the 
found lepton. The heavy quark DPI component is shown only for this last histogram. The plot is a good 
indication of the fact that kinematic cuts on the lepton transverse momentum, as well as a more severe cut 
on the lepton isolation, should be very effective in removing the heavy quark background(s), as we will 
also demonstrate later. 

In the plots we have not superimposed several ISR components that are, as expected, largely dominating 
in all the corners of the phase space (for instance Wjjjj with respect to Wjj (g) jj DPI or bbjj with 
respect to bb (g) jj DPI). Therefore, analyses able to reach high purities in the top signal over the ISR 
background, should also be able to highly suppress all the analogous signatures coming from DPI events. 

The DPI contributions can however reach sizeable fractions of the ISR backgrounds for analyses with 
non sufficiently tight cuts. In the figures, for instance, we compare Wjj{+paiton shower) rescaled by a 
factor 10 with the Wjj (8) jj DPI component. The rate of the latter is sizeable and the two have similar 
kinematic characteristics: the additional jets from the second interaction have the tendency to be softer 
and slightly more forward in the acceptance. This suggests that harder cuts on the jet transverse momenta 
should also be very effective in reducing the DPI component. From the figures it also turns out that there 
are significant discrepancies in the normalisation, as well as in shape, for the prediction of the Wjj jj 
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Fig. 17: Distribution of the highest ; the fourth highest px^ ; the largest separation in | Ar/I among the 
four highest pj- jets; the | A(/)| of the two jets complementary to those with the largest separation in | A0| 
among the four highest p-p jets; the scalar sum of the p^^'s of the four highest p^- jets; the p^- of the charged 
lepton. The plots are made with the cuts in Eq. 14 and for 100 pb~^ of integrated luminosity at the LHC 
at ^ = 10 TeV. 
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Fig. 18: Distribution of the highest px ', the fourth highest pT ', the scalar sum of the p^'s of the four 



highest Pt jets; the px of the charged lepton. The plots are made with the cuts in Eq. 14 and a more severe 
lepton isolation condition, and for 100 pb^^ of integrated luminosity at the LHC at ^/s =10 TeV. The 
lepton isolation cone has now an opening of AR=0.5. 



DPI ME and P8 components. The difference in normalisation is a direct consequence of the different 
normalisation cross section for the DPI component in Pythia 8 with respect to the model in |83| and the 
Tevatron measurements. The relatively small shape difference can everywhere be ascribed to the fact that 
the distributions for Wjj jj ME are made at parton level prior to any showering, whereas for Pythia 8 
they are all at hadron level and, therefore, also account for shower radiation. 

The conclusions drawn so far have a very mild dependence on the lepton selection, with the exception 
of the reduction of the heavy quark component, for which a tighter isolation requirement is mandatory. 
The plots in Figure [18] correspond to those in Figure 17 with the exception that the isolation cone for 
the lepton has now an opening of AR=0.5. Shape and normalisations for the signal and the W+jets 
backgrounds do not change significantly, whereas the contribution of the heavy flavour component is 
significantly reduced. 

Another way to tighten the analysis, and protect the signal even further from the DPI component is to 
increase the jet transverse momentum threshold. Figure [19] shows the scalar sum of the four highest pr 
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Fig. 19: Distribution of the scalar sum of the p^'s of the four highest pr jets and the fourth highest px^ - 
The plots are made for 100 pb~^ of integrated luminosity at the LHC at ^/s = 10 TeV. The lepton isolation 
cone has now an opening of AR=0.5 and px > 10 GeV. 



jets transverse momenta and the p^ of the fourth jet in the event after that both lepton isolation conditions 
and jet thresholds have been tightened. The latter has been moved from 20 GeV to 40 GeV. The top pair 
signal component is now scaled by a factor 10, making it evident that all DPI component are significantly 
reduced. What is also interesting is that in this region of phase space the accepted cross section (total and 
differential) for Wjj (>S> jj ME and P8 are essentially equivalent. This indicates that either approach is 
equivalent in describing DPI at relatively high transverse momentum. 



8.4 FURTHER RESULTS 



In Sect. 8.3 we have compared the Wjj jj parton level DPI contribution to a number of contributions 
generated with Pythia 8. We have neglected the jjj ® jW and jjjj (g) W components because they 
cannot be directly generated only with Pythia. In this section we complete our analysis in this regard. In 
Figure 20 we present the distribution of the largest jet transverse momentum for jj jjW, jjj jW 
and jjjj (g) W. We also show the Single Parton Interaction (SPI) result for W + 4j and the sum of SPI 
and MPI contributions. We neglect the small Triple Parton Interaction component. All results in this 
section are purely at parton level without showering and hadronization. The selection cuts are: 



PT, > 20 GeV , Irjjl < 5.0 , ARjj > 0.5 , pT, > 20 GeV , {rjil < 2.5 , ARji > 0.1 (15) 

The W + Aj/Wjj ®jj/ jjj ®jW/j jjj 0W contributions are in the ratio 1/0.33/0.11/0.04 and therefore 
the DPI rate is about half of the SPI rate, with the Wjj jj component contributing about two thirds of 



the DPI total. As a compaiison we recall that in 1 78 1 the DPI contribution toW + 4j was estimated at 



about 10% at = 14 TeV. Only part of the discrepancy between the two results can be explained by the 
smaller effective cross section aeff = 12.0 mb employed here in comparison to o-g// = 14.0 mb used 
in ||78]. Therefore our preliminary analysis suggests that the DPI background to + Aj production is 
proportionally larger at ^/s =10 TeV than at 14 TeV. 

CONCLUSIONS 

We have asked ourselves the question whether MPI can be a source of unexpected background for high p^ 
physics at the LHC. In this study we have taken top production as a perfect use case, trying to determine 
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Fig. 20: Distribution of the highest p^, for jj (E> jjW, jjj (S> jW and jjjj (S> W. Also reported are the 
SPI result for W + 4j and the sum of SPI and MPI contributions. 



which MPI background can induce a signature compatible with ti. We have shown that DPI coming from 
QCD processes and either W production or heavy quark production can indeed fake a top pair signature, 
becoming a significant fraction of the remaining background if the analyses are not sufficiently tight. 
As main experimental cuts for reducing drastically DPI we suggest the minimum transverse momentum 
threshold for jets and the lepton isolation, which are expected to be also effective for standard multi-jet 
backgrounds. In our study we have compared two ways of generating DPI events: one based on Pythia 8 
and the other by direct combination of parton level events. The differences between the two approaches are 
important, especially in the softer regionss of phase space and for what concerns the global normalisation, 
and are ultimately due to the assumptions on the cross section to which DPI are normalised. The present 
results suggest that the DPI background to 1^ + 4j production is proportionally larger at 10 TeV than at 
14TeV 
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9. A MATCHING SCHEME FOR W-f NLO MATRIX ELEMENT GENERATOR AND Pythia 8 
PARTON SHOWERED 

9.1 INTRODUCTION 

Diboson production is among the early physics topics to be studied at the LHC initial phase. In particular, 
the W'j process has a higher rate compared to the others, which can be pursued at moderate luminosities 
to probe the gauge structure of the Standard Model. The presence of anomalous couplings between the 
W-bosons and the photon is expected to enhance the harder part of the photon transverse momentum (p^ ) 
spectrum which is one of the experimental observables. Higher order QCD corrections will contribute 
significantly at the LHC, modifying the spectrum and hence may obscure the effect of any anomalous 
coupling that might exist. This calls for at least a NLO QCD event generator to accurately describe the 
parton level process. However, for a full event description including the initial and final state radiations 
(ISR and FSR) and the underlying events the parton shower approach is required. In this contribution we 



discuss a matching scheme for Wj events generated by Wgamma_NLO | 84 1, which is a complete O (a 



calculation, to the Pythia 1 18 1, |[20l parton showers. The proposed methodology for the matching preserves 



both the rate of the hard scattering process as well as various kinematic distributions of experimental 
interest, like the photon transverse momentum. 

9.2 THE MATCHING STRATEGY 

The Wgamma_NLO package | f84| , [85) is a matrix element-based event generator which produces 
events in hadronic collisions with up to one additional parton (quark or gluon) in the final state. It has the 
provision for the introduction of anomalous WWj couplings in addition to Standard Model ones and 
hence interesting for experimental analysis. 

The Wgamma_NLO package produces two types of event topologies: 

• 3-body final state, from +0 jet events, containing the lepton and the neutrino from W-decay 
and the photon; 

• 4-body final state, from +ljet events, containing an additional parton (quark or gluon) forming 
a jet. 

Figs. 21 and[22]depict the Bom level diagrams and some of the higher order QCD corrections for the W'j 



process respectively. Fig. 23 compares the pj, spectrum at the Born level to that at NLO, as obtained from 
Wgamma_NLO . The cross-section for the W-f process is 1.390±0.029 pb at NLO and 0.584± 0.005 pb 
at Bom level for > 25 GeV/c with < 2.1 and p^^*"" > 15 GeV/c with |r/'^P*°"| < 2.6, r/^ and 
^lepton ]jQiYig the pseudorapidity of the photon and the lepton from W-decay respectively. The selection 
applied here are typical of the LHC experiments. This gives a pj, -dependent k-factor with a mean value 



of of 2.16, which is quite substantial (Fig. 23 inset). 

Pythia 8 has been used for implementing the p^-ordered parton shower to the hard-scattered part 
generated by Wgamma_NL0 . The ISR process in Pythia adds coloured partons to the incoming quarks 
of the hard scattering process which may transform a 3-body event to a 4-body event, thereby effectively 
changing the cross-section of the individual 3-body and 4-body processes. The idea behind the matching 
scheme is to generate the parton showers without changing the number of individual events; i.e., the 
parton shower should not populate the regions of phase space for the Wjj events, j being a parton, which 
are already filled by the matrix element generator. 

It is to be noted that Wgamma_NL0 package produces weighted events which has to be unweighted 
before matching where only events with unit weight are considered. The unweighting is done using the 
standard "hit-and-miss" technique, where the ratio of an event weight to the highest weight is compared 
against a number drawn from a uniform deviate and the event is kept if the ratio is greater than the random 
number generated from the uniform distribution. Events with an occasional negative weight, from virtual 
loops, are not considered. 

" Contributed by: D. Majumder, K. Mazumdar, T. Sjostrand 
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Fig. 21: Bom level subprocesses for production in hadron-hadron collision : s-channel process (left), 
t-channel process (middle) and u-channel process (right). 
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Fig. 22: Some higher order QCD diagrams for Wj production in hadron-hadron collision. 



9.21 The matching algorithm 

In the matching procedure, the discrimination of 3-body vs. 4-body state is performed according to a pre- 
defined threshold value pf^f^''^'*^ (^5 GeV/c for the present study), which serves as the boundary between 
the matrix element regime on its upper side and the parton shower regime on its lower. The matching 
procedure starts with projecting the 4-body system to a 3-body system, by assuming that the outgoing 
parton can be emitted from either of the incoming partons (flavours permitting), with relative weights 
determined by splitting kernels and parton densities. Effectively this corresponds to the assumption 
that the 4-body state never had a parton emitted and all the kinematics are recalculated based on this 
assumption. Subsequently, this projected 3-body final state event is treated according to the following 
steps: 

1 . Shower the projected 3-body event and compare p^°^^'^ at the first ISR branching with the ^j^"^*"" 
in original 4-body: 

• If p^<^<^'^ > p^'^^°" then the event is reclassified to 3-body; move to step 2 below. 

• If ps^o^e^ < then the original 4-body event is retained as a 4-body event; move to 
step 3 below. 

2. Shower the 3-body events: 

• Compare p^°^'^'^ with pf^P"-'^'"-*^ after first ISR branching, pf^^"-''"-^^ can be considered to be 
the boundary between the ME calculation's regime and that of the parton shower's. 

• If p?^"^'^'^ > p^epar-ate ^^^^ further shower evolution and restart the parton shower 
until pf'^^'- < p^'^"''"*". 

• Continue with the rest of the shower. 

3. Shower the 4-body events: 

• The pf"'^^'' from the first ISR is compared with If > then restart the 
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Fig. 23: Differential cross-section for the photon transverse momentum at Bom level is compared with 
the differential cross-section for NLO W'y process. Inset: k-factor as defined by the ratio of the NLO 
cross-section to the Born cross-section. 



ISR branching until pf^^^ < 
• Continue with rest of shower to give a complete event. 

9.22 Matching results 

The 4-body final states from the matrix element calculation lacked the Sudakov form factor for QCD 
emission which was corrected for by the above-mentioned algorithm. It is to be noted that the first ISR 
emission from the 3-body events are always confined below pf^^"'^"'^'^ . But this is not so in the case where 
the parton showering is only required to be softer than the matrix element parton. 



Fig. 24 (left) shows the distribution of the transverse momentum of the W'y system obtained by 



Wgamma_NLO superposed with the same after the Pythia showering. The distribution from Wgamma_NLO 
shows many events with equal to zero which are the 3-body events. The non-zero values correspond 
to events with a parton in the final state. After the matching, the kink after the zeroth bin fills up due to 
the boost from Pythia ISR. The area under both these curves however remain the same indicating that 



the exclusive cross-section of the 1-jet events remain conserved after the parton shower. Finally, Fig. 24 
(right) shows the photon pT spectrum before and after the matching. 

The matching scheme is rather simplistic and is suitable for this particular event topology which 
contains only one hard jet in the final state. The advantage is that it does not require any modification 
of the matrix element calculation of Baur for the Sudakov form factor needed for the parton emission. 
This is in contrast with packages based on leading order multi-paiton final state calculation, where the 
matching scheme with parton shower effectively reduces the cross-section. 

9.3 SUMMARY 

The matching strategy as enunciated above preserves the cross-section of the + 1 jet events. Also, for 
a reasonably high cut on pj, , the spectrum before and after the matching is identical. It remains to be seen 
how stable the scheme is with respect to changes in the boundary between the matrix element and parton 
shower, as defined by pf^P"-"^"-^^ and also its performance in comparison to matched events generated by 
other matrix element generators like Madgraph and Alpgen. The final test of the best suited method can 
be performed only with good-statistics real-collision data at LHC. 
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Fig. 24: The transverse momentum of the system before Pythia showering and after Pythia showering 
(left) and photon px spectrum before and after the showering (right). 
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10. THEORY TESTS OF PARTON SHOWERS B 
10.1 INTRODUCTION 

The last few years have witnessed significant progress in the improvement of parton shower algorithms 
with the invention of new shower algorithms based on the dipole or antenna picture |[86 - 92 1 known 



from next-to-leading order calculations. These algorithms are able to satisfy simultaneously at each step 
momentum conservation and the on-shell conditions. This is possible, because they are based on 2 — 3 
splittings, where the spectator can absorb the recoil. Within the traditional 1 — >• 2 splitting algorithms it is 
impossible to satisfy simultaneously momentum conservation and the on-shell conditions for a splitting. 
In addition these new algorithms implement in a correct way simultaneously the soft and the coUinear 
limit of the matrix elements. When both emitter and spectator are in the final state these new algorithms 



are very similar to the shower algorithm implemented in Ariadne |93 -97 |. The new algorithms extend 
the dipole or antenna picture to final-initial, initial-final and initial-initial configurations. 

For any shower algorithms one needs an evolution variable. A sensible choice is 

t = In^, (16) 

where is a fixed reference scale and k± is the transverse momentum of a splitting. During the shower 
evolution we move towards smaller (more negative) values of t. The central object of a shower algorithm 
is the Sudakov factor, describing the no-emission probability. For a dipole shower it is calculated from the 
individual dipoles. For a dipole with emitter i and spectator k, the Sudakov factor is given by 

\j,k{tl,t2) = exp ^ J dtC'-j^ J d(punres6 (t - T-.j^^ Vij^k^ , (17) 

where ^ is a colour factor. Tj depends on the dipole invariant mass (p| + P^)^ and the phase space 
variables for the emission of an additional particle. The essential information is given by the function 
Vij^k, which encodes the singular part of the matrix elements for the emission of a particle. As an example 
we quote this function for the g — )- ggi splitting: 



[Pi+Pk) y 



l-z{l-y) 



(1 + ^) 



(18) 



as is evaluated at the scale = —k'j_. The shower algorithm proceeds through the following steps: One 
first chooses the next scale t at which a splitting occurs according to the Sudakov factor. For this given 
choice of t one then chooses the momentum fraction z according to Pa^bc{z) and an azimuthal angle if 
either uniform or according to spin-dependent splitting functions. With these three variables (t, z, Lp) one 
can construct all four-vectors after the emission. The spectator absorbs thereby some recoil. We now have 
a configuration where one particles was emitted. For t > tmin one goes back to the first step, otherwise 
one stops. 



10.2 EVOLUTION VARIABLES 

In the last year there has been some discussion whether the new shower algorithms based on dipoles or 
antennas |98- 100| have the correct collinear limit. In particular it has been argued that the fact that the 
spectator takes some recoil could conflict with collinear factorisation. This question has been solved 
and it has been shown that the algorithms based on dipoles or antennas describe correctly the collinear 
limit 1 99 100| . This result could have been anticipated from the fact, that the dipole splitting functions 

"^Contributed by: S. Weinzierl 
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Pa-^hc reduce in the collinear limit y — > to the Altarelli-Parisi splitting functions. For the example of a 
q — )• g^-splitting we have 

87ras(//^) 1 



lim V, 



Cf- 



1 - z 



[l + z] 



(19) 



In addition the momentum mapping 

Pi = zpi + k± + y {I - z) p-^, pj = (1 - z)p. -k±+ yzp-^, = (1 - y)p-^, (20) 
reduces in the collinear limit y — )• and fc^ — )• to 



Pi = ZPI^ Pj = (1 - z)Pi^ Pk = Pk- 



(21) 



In particular, the spectator does not receive any recoil in the collinear limit. What emerged from the 
discussions is that the choice of the shower evolution variable is not arbitrary. This lead to the definition 
of an "infrared sensible" evolution variable | ,100] . The definition of infrared sensible is that both infinitely 
soft and collinear emissions should be classified as unresolved for any finite value of the evolution variable. 



The evolution variable based on the transverse momentum as in eq. ( 16 1 is an infrared sensible evolution 
variable. On the other hand, the choice of the energy of the emitted particle (in the centre-of-mass system 
of the emitter and the spectator) is not infrared sensible. There are configurations where the emitted 
particle has finite energy, but which are infinitely collinear. 



10.3 MULTIPLE SOFT EMISSIONS 



The shower algorithms above are derived from first-order perturbation theory. The singular functions 
entering the Sudakov factor are derived from tree-level matrix elements, where a single particle becomes 
either soft or collinear. The parton shower therefore reproduces the leading-log behaviour of the matrix 
elements in the limit of an infinitely large hierarchy between the scales of each successive emission. No 
claim is made to describe correctly the case of multiple emissions at the same scale. Nevertheless I would 
like to make a few comments what could be expected from future algorithms incorporating these effects. I 
will discuss multiple soft emissions, as derived from the singular behaviour of the matrix elements. If a 
single gluon becomes soft, the square of a partial tree amplitude factorises according to 
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In any strongly ordered limit the emission of r soft gluons is described by 1 101 1 
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where a is permutation of (1, 2, r). However, if two gluons are emitted at the same scale Ei w E2 
we have a more complicated formula. In the limit of two soft gluons a partial tree amplitude factorises 
according to | |102| 
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The expression in the square bracket does not contribute to any strongly ordered hmit (£'1 <^ E2 <^ Ea, Eh 
or E2 <^ El <^ Ea, Ejy). In the strongly ordered Umit it is less singular than the first term. However for 
Ei,E2 <^ Ea, Ejj both terms scale like 1/A^, if the momenta of the soft gluons are rescaled by A. 
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11. HIGGS BOSON PRODUCTION VIA GLUON FUSION AT THE LHC: A COMPARATIVE 

STUDY [3 
11.1 INTRODUCTION 

The primary motivation for ongoing and impending physics programmes at the Tevatron and LHC is to 
gain insight into the nature of electroweak symmetry breaking. The great majority of the effort in this 
direction is devoted to the hunt for the Higgs boson, the origin of this symmetry breaking in the Standard 



Model I IT03H106J . 



Of all the ways in which the Standard Model Higgs boson can be produced, the gluon fusion pro- 



cess 1 107 1, in which it couples to colliding gluons via a top quark loop, has the largest cross section 
for Higgs boson masses less than ~700 GeV. This process will be extremely important in detecting and 
studying the Higgs boson at the LHC in the low mass region, favoured by fits of the Standard Model to 
electroweak precision data [ 108] and in part also by direct searches at the Tevatron [ 1 09„110J , for which 
Higgs boson decays into two photons are expected to give a clean experimental signal. 

Although observing a narrow resonance in the diphoton invariant mass spectrum should be possible 
using only the experimental data [111], determining the quantum numbers and couplings of the resonance 
i.e. determining that it really is a fundamental scalar, in particular, the Standard Model Higgs boson, 
requires Monte Carlo simulations to predict distributions for both signals and backgrounds. 

In recent years Monte Carlo event generators have been the subject of great theoretical and practical 
developments, most significantly in the extension of existing parton shower simulations to consistently 



include exact next- to-leading order (NLO) corrections 1 12- T5|[25||112 - 125 1 and, separately, in the 



consistent combination of parton shower simulations and high multiplicity tree-level matrix element 
generators 1 4 5] 65 70 126 - 129 1. The state-of-the-art in fixed order predictions has also undergone major 



improvement, resulting in fully differential Monte Carlo predictions at next-to-next-to-leading order 
(NNLO) for inclusive Higgs production and, simultaneously, NLO accuracy for production in association 
with a hard jet p30HT331. 

In this article we present predictions from most of the Monte Carlo simulations arising from this 
theoretical activity, for the gluon fusion process at the LHC, at the expectecj^ initial hadronic centre- 
of-mass energy, y/s = 10 TeV. As well as updating existing results, based on the originally forecast 
14 TeV centre-of-mass energy, this document represents a broad comparative study among a number of 
fundamentally different Monte Carlo approaches, hence it also serves to gauge their relative merits and 
gauge the stability of our theoretical predictions with respect to the various methods. 

In the following we shall concisely review the pertinent features of the simulations included in the study, 
prior to presenting results for a variety of simple observables concerning the gluon fusion production 
channel. At the end of the article we summarise the results and comment on the readiness of these 
theoretical tools for much anticipated Higgs analysis. 



11.2 MATRIX ELEMENTS AND PARTON SHOWERS 

In modern experimental particle physics, shower Monte Carlo (SMC) programs have become an indis- 
pensable tool for data analysis. From a user perspective, these programs provide an approximate but 
extremely detailed description of the final state in a high-energy process involving hadrons. They provide 
Si fully exclusive description of the reaction, as opposed to fixed-order QCD calculations, which are only 
suitable for the computation of inclusive quantities. 

After a latency period, research in SMC is once again very active, with significant advances being made 
in the last decade. In general these develo pments can be grouped into two main classes: 
"Contributed by: M. Grazzini, K. Hamilton, S. Hoche, F. Maltoni, C. Oleari, S. de Visscher, J. Winter 
^"At the time the study was started. The current plan is for running at 7 TeV and 14 TeV, which leaves this study nicely in the 
mod range. 
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1. The merging of leading-order matrix elements (ME), characterized by a high number of final- 
state partons, with parton showers (PS). Examples of such methods are the CKKW matching 
scheme ||4] |126 |, the MLM matching procedure fTO} as well as the newer merging schemes based 
on truncated parton showers |65 129]. 

2. The interfacing of NLO calculations (that are typically available only with a small number of legs 
in the final state) with parton shower simulations (MC@NLO [12| and POWHEG y_4])- 

All of them have to face the same problems: avoiding overcounting of events as well as the presence of 
dead regions. 



11.3 MONTE CARLO PROGRAMS FOR THE STUDY 

With the exception of the Hnnlo program all of the Monte Carlo programs used in this study fall into 
one or other of the two classes described above, implementing some form of matching/merging between 
fixed order calculations and parton shower simulations. In order to have a more full comparison of the 
available Monte Carlo tools, we also include Hnnlo, which is based on a fixed order NNLO calculation 
of Higgs-boson production via gluon fusion. 



11.31 HNNLO 

The HNNLO program is based on an extension of the NLO subtraction formalism to NNLO, as described 
inref. | fT32l . 

The calculation is organized in two parts. In the first part, the contribution of the regularized virtual 
corrections is computed up to two-loop order. In the second part, the cross section for the production 
of the Higgs boson in association with one jet is first evaluated up to NLO, i.e. up to 0(af )), using 
conventional NLO subtraction methods. Now, since the H + jet cross section is divergent when the 
transverse momentum, q^^, of the Higgs boson becomes small, a further counterterm must be subtracted 
to make the result finite as — ^ 0. To this end the program uses counterterm introduced in ref. |132| . 
Having regularized the real and virtual parts, the two contributions can be combined to reconstruct the 
full cross section. Organizing the differential cross section in this way, one can construct a parton-level 
event generator with which arbitrary infrared safe quantities can be computed. The present version of the 
program includes the decay modes — )• 77 |[l32j, H — )• WW — Ivlv and H — ZZ — )• 4 leptons 1 133) . 

The calculation is performed in the large top-mass approximation. This is known to be a good 
approximation provided that the Higgs boson is not too heavy and the transverse momenta of the final 
state jets are not too large. 



11.32 MadGraph / MadEvent 

Besides the possibility of generating processes in a long and extensible list of theoretical models (SM, 
MSSM, Higgs effective theory ...), the Monte-Carlo generator MadGraph/MadEvent (MG/ME) |81] is 
also intended to simulate accurately the QCD radiation from initial and final states when coupled to a 
parton shower simulation. Such an aim can be achieved by using jet matching and a phase-space slicing 
technique 

In MG/ME, three jet matching schemes (using Pythia 6.4 |T8|) are implemented, namely the MLM (70), 



the fc^-MLM | |134[|135J and the shower- fc^ [135] schemes. While the first two methods work with both 
virtuality and pT-ordered showers, the third one only works with the pT-ordered showers. For each of 
these methods, no analytic Sudakov reweighting of the events is performed, instead showered events are 
rejected if they are not matched to the ME-level partons. A detailed comparison of the fc^-MLM and 
shower-fcx behaviours has shown that their respective outputs are very similar for the production of heavy 
colored particles in the SM and beyond | |135 1. In addition, a comparison for the production of 1^-f-jets 
events between the results from /c^-MLM and other simulation chains with different matching schemes 
has led to a similar conclusion [134[ . 
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Having the computation of the effective coupUng between a scalar or pseudo-scalar to the gluons, the 
accurate simulation of the production of a Higgs boson accompanied by initial and final-state radiation is 
therefore relatively straightforward. In order to compare the MG/ME production with the results from the 
other generators considered in this work, we choose to simulate H + 0,1,2 partons at the ME level and 
apply the k±-MLM scheme with = 10 GeV and Qmatch = 15 GeV. These choices were made in 
agreement with the smoothness of ME— )>PS transition regions in the 2 — )• 1 and 1 — )• differential jet 
rates distributions. 



11.33 MC@NLO 

The MC@NLO method fl^ was the first one to give an acceptable solution to the overcounting problem. 
The generality of the method has bee n exp licitly proven by it s application to processes of increasing 
complexity, such as heavy -flavour-pair 1 112| and single-top 1 113) production]^ The basic idea of MC@NLO 



is that of avoiding the overcounting by subtracting from the exact NLO cross section its approximation, 
as implemented in the SMC program to which the NLO computation is matched. Such approximated 
cross section (which is the sum of what have been denoted in [ 12] as MC subtraction terms) is computed 
analytically, and is SMC dependent. On the other hand, the MC subtraction terms are process-independent, 
and thus, for a given SMC, can be computed once and for all. In the current version of the MC@NLO code, 
the MC subtraction terms have been computed for Herwig 6 f 136], but extensions to other SMC are 
possible. In general, the exact NLO cross section minus the MC subtraction terms does not need to be 
positive. Therefore MC@NLO can generate events with negative weights. For the processes implemented so 
far, negative- weighted events are typically about 10-15% of the total. Their presence does not imply a 
negative cross section, since at the end physical distributions must turn out to be positive, but affects the 
overall efficiency of the simulation. 

The features implemented in MC@NLO can be summarized as follows: 

- Infrared-safe observables have NLO accuracy. 

- CoUinear emissions are summed at the leading-logarithmic level. 

- The double logarithmic region (i.e. soft and collinear gluon emission) is treated correctly since 
Herwig 6 uses an angular-ordered shower. 



11.34 POWHEG 

The POWHEG (Positive Weight Hardest Emission Generator) method was proposed in ref. [14J . This 
method overcomes the problem of negative weighted events, and is not SMC specific. In the POWHEG 
method, the hardest radiation is generated first, with a technique that yields only positive-weighted events, 
using the exact NLO matrix elements. The POWHEG output can then be interfaced to any SMC program 
that is either -ordered, or allows the implementation of ap^- vetoj^ However, when interfacing POWHEG 
to angular-ordered SMC programs, the double-log accuracy of the SMC is not sufficient to guarantee the 
double-log accuracy of the whole result. Some extra soft radiation (technically called vetoed-truncated 
shower in ref. [T?]) must also be included in order to recover double-log accuracy. In fact, angular ordered 
SMC programs may generate soft radiation before generating the radiation with the largest pT, while 
POWHEG generates it first. When POWHEG is interfaced to shower programs that use transverse-momentum 
ordering, the double-log accuracy should be correctly retained if the SMC is double-log accurate. The 
ARIADNE program f97l, Pythia 6.4 [18] (when used with the new showering formalism), ADICIC++ [W] 
and the new parton showers based on the Catani-Seymour dipole formalism ||89„90 1 adopt transverse- 
momentum ordering, and aim to have accurate soft resummation approaches, in the limit of large number 
of colours. 

^' A complete list of processes implemented in McSNLO can be found at 

http : / / www . hep . phy . cam . ac . uk/theory/webber /MCatNLO . 
^^AU SMC programs compatible with the Les Houches Interface for User Processes |2| should comply with this requirement. 
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Up to now, it has successfully been applied to Z pair hadroproduction |117|, heavy-flavour pro- 
duction 1 1 18 1, e+e^ annihilation into hadrons 1 1 19 1 and into top pairs 1 125 1, Drell-Yan vector boson 
production 1 120 122 1, W' production 1 137 |, Higgs boson production via gluon fusion 1 121 123 1, Higgs 
boson production associated with a vector boson (Higgs-strahlung) 1 121 1, single-top production 1 124 1 
Z + 1 jet production | ,138J , and, very recently, Higgs production in vector boson fusion |139| . 



THE POWHEG BOX The POWHEG Box is an automated package able to construct a POWHEG imple- 
mentation of a NLO process, given the following ingredients: 

1. The list of all flavour structures of the Bom processes. 

2. The list of all the flavour structures of the real processes. 

3. The Born phase space. 

4. The Born squared amplitude, the color correlated and spin correlated Born amplitudes. These are 
common ingredients of NLO calculations regularized with a subtraction method. 

5. The real matrix elements squared for all relevant partonic processes. 

6. The finite part of the virtual corrections, computed in dimensional regularization or in dimensional 
reduction. 

7. The Born color structure in the large limit of the number of colors. 

The plots in this article were obtained using the POWHEG Box, and the completion of the shower has been 
done both with Pythia 6 and with Her wig 6. 



11.35 Herwig++ 

Herwig++ builds and improves upon the physics content of the parent Herwig 6 program, particularly in 
regard to the accurate simulation of QCD radiation. A major success of the original Herwig 6 program 
was in its modeling the effects of soft gluon interference, specifically the colour coherence phenomenon, 
whereby the intensity of soft gluon radiation, emitted at wide angles with respect to a bunch of collimated 
colour charges, is found to be proportional to the coherent sum of emissions from the constituents i.e. the 
jet parent 1 101[ 140- 148]. This effect is manifest in the perturbative series as large soft logarithms and is 
implemented as an angular ordering of successive emissions in the parton shower. 

A further significant accomplishment of the POWHEG pT] formalism is in fully catering for such effects 
through a careful decomposition of the angular-ordered parton shower into a truncated shower, describing 
soft wide angle radiation, the hardest emission, as described above, and a vetoed shower comprised of 
increasingly coUinear emissions. In doing so the formalism provides a means of distributing the highest 
Pt emission in an event according to the exact real-emission matrix element, including resummation 
effects, without degrading or otherwise disturbing the resummation and colour-coherence properties 
inherent to the parton shower. 

The facility to perform truncated showers is absent from the older fortran Herwig 6 program but is 
implemented in the new Herwig++ program, which also includes its own native, independent, POWHEG 



simulation for the gluon-fusion process, the results of which are presented in section 11.5 



11.36 Sherpa 

Sherpa is a multi-purpose Monte-Carlo event generation framework for colliders | 22p3) . The main goal of 
this project is a proper simulation of the perturbative aspects of the collision, although significant improve- 
ments have been made over the past years regai^ding the simulation of non-perturbative dynamics, like 
the process of hadronisation. One of the key features of the Sherpa program is a general implementation 
of a novel technique for combining tree-level matrix elements with parton showers, in arbitrary QCD or 
QCD-associated processes |65 1. To this end, Sherpa makes use of its two internal tree-level matrix element 
generators AMEGIC++ |149| and Comix |150) , which are capable of simulating both Standai^d Model 
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(AMEGIC++ and Comix) and beyond Standard Model (AMEGIC++) reactions with high-multiplicity final 
states. Soft and coUinear parton radiation is generated in Sherpa by means of a parton shower based on 



Catani-Seymour dipole factorisation 1 89 1. This formalism has apparent advantages compared to more 
conventional parton showers, which are based on strict 1 — 2 splittings and often lack the notion of a 
well-defined spectator parton. In the new approach, the recoil partner of a splitting parton is always a 
single external particle. This turned out to be an important ingredient when combining paiton-shower 
evolution with higher-order tree-level matrix elements. 

The technique for merging matrix elements and parton showers, which is employed by Sherpa, is based 
on phase-space slicing. A detailed description of the corresponding algorithm, and its relation with other 
tree-level merging techniques, can be found in [65]. The method has recently been applied to mixed 
QCD and electroweak processes, in particular photon and diphoton production 1 151) and proves to give a 



consistent and reliable description of data from deep-inelastic lepton-nucleon scattering 1 152| . Compared 



to the CKKW algorithm \4\ 126) , the new merging scheme of ref f65l is more sophisticated and improves 



over CKKW by including truncated vetoed parton showers. The results are more accurate, with respect to 
those of the CKKW approach, which has been employed in former versions of the Sherpa event generator 
with already great success | |134[|153f|155| |. 



11.4 Parameters for the study 

In the following, we present results for the pp LHC at a centre-of-mass energy of 10 TeV. The common 
features of the analysis ai^e the following: 

• Model: We work in the Standard Model in the large top-mass limit. In this approximation the 
couplings between the gluons and the Higgs boson are described by a dimension-five effective 
interaction ^ 

^h = -jgh G^iuG^H, (27) 

withg',, = as/{3iTv). 

• Event samples: The analysis was performed on the generated final state and, with the exception of 
HNNLO, after parton showering. Hadronization effects were included for the MC@NLO and POWHEG 
results only. 

A Higgs boson mass of mn = 120 GeV was assumed. Tree-level predictions were generated using 



the leading order CTEQ611 PDF set 1 156 1, while generators employing the POWHEG method used 
the next-to-leading order PDF set CTEQ6m \l56j . In both cases the parametrisation of the strong 
coupling was chosen accordingly. All partons (excluding the top quark) were taken to be massless 
and their Yukawa couplings were neglected. 
• Jet definitions: Jets were defined using the longitudinally invariant -algorithm with D = 0.7 in 
the implementation of FastJet |157|. They were required to lie within a rapidity range of |?7| < 4.5 
and have transverse momenta of px > 20 GeV. 

11.5 RESULTS 

In this section we present and discuss results obtained for some key observables in the analysis of 
Higgs production via gluon fusion. As already mentioned, we do not expect the LO matched results 
to provide reliable information on total rates, so we have normalized the corresponding curves, from 
MADGRAPH/MADEVENT and Sherpa, to the Hnnlo result. On the other hand, having NLO accuracy, 
Herwig++, MC@NLO and POWHEG have not been rescaled. 

In fig. [25} we plot the transverse momentum of the Higgs boson and, in fig.|26j its rapidity. The Higgs 
boson pt, in particular, is determined by the recoiling QCD radiation, both soft and hard, and, exactly 
as for Drell-Yan, it is therefore a key observable. The blow up of the small-p^ region (left panel of 



fig. 25 1 shows quite good agreement among the various MC approaches with predictions being typically 
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Fig. 25: The transverse momentum spectrum of the Higgs boson. Tree-level predictions have been 
rescaled by the global factors indicated in the legend. The lower panels display the ratio of individual 
results and the average of all histograms, excluding the results from HNNLO. 



peaked in the range between 5 and 10 GeV. The obvious excess in the Hnnlo prediction, at low-pT, is 
expected on the grounds that it is based on a fixed order computation, hence, it does not resum the effects 
of multiple soft emissions, which are essential for a proper description of the px = region. At higher 
values of the px, the agreement is also excellent, apart from MC@NLO which shows a steeper behaviour 
with respect to the results obtained by the POWHEG method and with the matching. As already pointed out 
in Refs [121 123 1, this is due to NNLO terms in the POWHEG formula. It is, however, important to note 
that, for all the NLO codes, this particular distribution can be predicted only at LO, i.e. no H + 2 partons 
contribution and no H -\- 1 parton one-loop contributions are included. From this point of view, it is 
reasonable to expect the shape to be sensitive to variations in the renormalization and factorization scales, 
although, in practice, this sensitivity is much milder due to the resummation of higher order corrections 
(i.e. the shower). In any case, it is both remarkable to see that the predictions based on the POWHEG 
method and the ME-nPS matching show such good agreement, particularly considering the fundamental 
differences in their methodology. 



In the rapidity distributions of fig. 26 the Hnnlo result shows all of its NNLO content: in fact, this is 
the only plot that receives contributions from the two-loop diagrams. 

Figure [27] shows the jet px distributions for the four hardest jets (ordered in px)- Once again the 
agreement among the various approaches is very good, with MC@NLO leading to significantly less events at 
very high pt's; this lower number of events is in exact correspondance with that seen for the Higgs boson 
transverse-momentum distribution and bears the same explanation. A particularly interesting feature is 
the agreement found on the 3rd and 4th jet spectra. Only Sherpa has included the corresponding tree-level 
hard matrix elements, while all other predictions contain only one (NLO codes) or two (Hnnlo and 
MadGraph/MadEvent) hard partons. This good agreement is a mere coincidence, since in POWHEG, 
MC@NLO and Herwig++ these extra jets come from the shower, and are therefore only corect in the strict 
collinear limit. Similar comments hold for the pseudorapidity distributions of fig. [28] 

Larger discrepancies are instead present for more exclusive quantities, such as the distance in the -q-cf) 
plane between the two leading jets, AR12, shown in fig. 29 Indeed, we start appreciating here some 
interesting differences in shape: Herwig++, for example, predicts much steeper distribution than Sherpa. 
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Fig. 26: The rapidity distribution of the Higgs boson. See fig. 25 for details. 



Finally, in fig. 30 we plot the jets rates, which, once again, agree within 50% uncertainty even for 



higher multiplicities. 
11.6 CONCLUSIONS 

We have reported on the first comparison among several different Monte Carlo approaches to the simulation 
of Higgs boson production in gluon fusion. Apart from some basic choices, such as the parton distribution 
functions, the collider energy and our choosing to use an effective theory where the top quark mass has 
been taken to infinity, no detailed tuning has been performed. The main idea being that of an "out-of- 
the-box" comparison among various codes, all of which represent the state-of-the-art in Monte Carlo 
tools. 

The upshot of our comparison is that, apart from the overall normalization, which is only reliable in 
NLO and NNLO codes, the various approaches give consistent results within their expected range of 
validity. 
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Fig. 27: The transverse momentum spectra of the first four hardest jets, ordered in pT, accompanying 



Higgs boson. See fig. 25 for details. 
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Fig. 28: The pseudorapidity distributions of the first four hardest jets, ordered in px, accompanying 



Higgs boson. See fig. 25 for details. 
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Fig. 29: The separation in the rj-ct) plane of the two leading jets accompanying the Higgs boson. See 



fig. 25 for details. 
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12. Wbb IN THE HIGH-pT HW REGION|f] 
12.1 INTRODUCTION 

The main aim of this section is to study the effect of next-to-leading order (NLO) QCD corrections to the 
pp — )• Wbb process, in the region of phase space which is relevant for the highly boosted WH analysis 



with H ^ bb \ 158 159 |.The details of the setup for the NLO computation in this kinematic region as 



well as results for the scale dependence of the total cross sections are presented in Section 12.2 The 
application of the NLO calculation to the experimental analysis on which the mentioned Higgs search is 
based is not straightforward, since the latter addresses the simulation of the backgrounds through the use 
of a parton shower Monte Carlo (MC) algorithm which is applied on top of the QCD leading order (LO) 
matrix element calculation. 

While for an increasing number of processes specific Monte Carlo generators have been made available 
to combine the NLO calculation with a parton-shower based MC generator, as in the MC @ NLO program p2) 
or in the new POWHEG method 1 15|l60| , no such generator is available for the pp — Wbb process yet. It is 



therefore useful to define a re-weighting procedure for pp — )■ Wbb events generated by the use of a parton 
shower Monte Carlo, which for this specific case is Herwig p7| , based on the distributions predicted by 
the NLO calculation. 

An important feature of the highly boosted WH analysis is the application of a tight jet veto. In the 



cut-based analysis 1 159 1 this veto is applied at px = 20 GeV, while in a more refined likelihood based 
analysis [161] the maximum px considered for a possible additional non 6-jet in the event is 60 GeV. The 
NLO QCD correction depends significantly on the px of an additional non 6-jet in the event and this is 
addressed in this study. This situation is analogous to other backgrounds for Higgs boson signals, as is 
the case of the ttbb process as a background for the search for tiH with H — bb, also considered in the 
present proceedings. 

In the specific case of the qq' — Wbb process, the real corrections which enter the NLO computation 
include an additional gluon-induced process, q{q) g — Wbbj, which, given the gluon luminosity at the 
LHC, makes the NLO correction very large. However, by constraining the transverse momentum of the 
extra non-b-jet through a jet veto one can reduce the impact of this extra channel, which then improves 
the scale dependence of the NLO computation, as long as such extra kinematical cut is not too restrictive. 

12.2 Wbb PRODUCTION AT NLO QCD WITH A TIGHT JET VETO 

The calculation of the total and differential cross sections for W^bb production at NLO QCD for this study 



is based on Refs. 1 162 - 164 1, which includes full 6-quark mass effects. The input parameters are chosen 
as follows: rrih = 4.79 GeV, nit = 173.1 GeV, Mw = 80.399 GeV, = 0.223, Gp = 1.16639 • 10"^ 
(the weak coupling being defined as: gw = {SM^^Gp/ V^)^^^'^)- The contribution from the third 
generation quarks in the initial state is neglected and the non-zero CKM matrix elements are chosen to be: 
Vud = Vcs = 0.974 and Vus = Vcd = 0.227. 

Parton-level LO results (calculated as a cross-check) are obtained using the one-loop evolution of 
and the CTEQ6L1 f82l set of PDFs with as(M^) = 0.130, while parton level NLO results are obtained 
using the two-loop evolution of and the CTEQ6M set of PDFS with as{Mz) = 0.118. 

RenoiTnalization (/i, ) and factorization (/ij) scales are set to be equal and varied by a factor of two 
around a central value /xq = [My^ + (p^)'^ + (Pt)'^]^^'^' where p^ and p^ are the transverse momenta of 
the jets generated by b and b respectively. 

The jets are constructed using the kx jet algorithm with pseudo-cone size R = 0.3 and the parton 
momenta are recombined within a jet using the so called £'-invariant scheme (=sum of momenta of the 
constituents). We study two samples of events, i.e. 

• events with just 2 6 jets; 



Contributed by: F. Febres Cordero, G. Piacquadio, L. Reina, D. Wackeroth 
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• events with 2 6 jets and 1 non-fe jet; 

where b and non-6 jets are identified imposing the following cuts: 

• > 30GeV, |r/^| < 2.5; 

• > 200 GeV, \rj^\ < 2.5; 

• 15 GeV < < 60 GeV, < 5; 

• p^^ > 200 GeV; 

• Rbb < 1-2 . 

p^ and jry^l denote the transverse momentum and pseudorapidity of either one of the two 6 jets, and 
\ri^\ are the transverse momentum and pseudorapidity of the W boson, pVj?'"'^^ and |ry"o"^''| denote the 
transverse momentum and pseudorapidity of the non-6 jet, R^i is the relative separation between the 6 and 
6 jets, and p^ the transverse momentum of the 66 2-jet system. 

The renormalization and factorization scale dependence of the total cross sections for W^bb production 
at the LHC {-s/s = 14 TeV) using this setup is studied in Table|5]and in Fig. 31 The NLO exclusive cross 



section corresponds to the 2 b-jet only sample, while the inclusive one corresponds to the sum of both 2 b 



jet only and 2 b jets and 1 non-b jet samples. Thus, in Fig. 31 the curve labeled as NLO Inc is the sum of 
the curves labeled as NLO Exc and 26 + j only. 

Table 5: LO, NLO inclusive, and NLO exclusive cross sections (in fb) for W^bb and W~bb production 
at y/s = 14 TeV. Listed separately are also the cross sections (in fb) for the 1^66 + j channel alone. The 
central values correspond to /^^ = /U/ = /^o> while the upper and lower bounds represent the maximal 
upper and lower variation obtained when varying /i^ = /^/ between ij.o/2 and 2/io- 



Process 


LO 


NLO inclusive 


NLO exclusive 


Wbb + j 


Ty+66 
Ty-66 


7(5+17 

"3-13 


155-, 
90t? 


40+iY 
26t?7 





It is interesting to note that, contrary to the findings in Refs. 1 162 - 164 1, in this kinematic regime 
the inclusive cross section exhibits less scale dependence than the exclusive one. Different factors may 
contribute to this different behavior, but mainly the kinematic cuts used in the present analysis make the 
jet veto much more selective and enhance the effect of unbalanced scale dependent logarithms in the 
exclusive cross section, while their effect tends to compensate between Wbb and Wbb + j production in 
the inclusive case. 



12.3 COMPARISON OF FIXED-ORDER AND PARTON-SHOWER RESULTS 

While the parameters used for the fixed-order LO and NLO computation were already presented in the 
previous section, the parameters used for the matrix element computation pp — )• Wbb which serves as an 
input to the parton-shower algorithm are listed in the following: 

• LO parton density functions CTEQ6L1; 

• strong coupling constant as{Mz) = 0.130, consistent with the PDF set used; 

• QCD renormalization and factorization scales both equal and set to Q = \J M"^ + p^ ^y; 

• one-loop running of a^. 

The LO matrix element computation and initial three-body phase integration is based on the AcerMC 
generator [ 1 65J (v. 3.5). This is then passed as an external process to Herwig (v. 6.510), which applies 
the parton shower algorithm and produces the final state partons which are used as an input to an 
inclusive Ut jet clustering algorithm with parameter R = 0.3 and with the £^-invariant parton momenta 
recombination scheme. The underlying event has been switched off during generation, while the effect of 
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hadronization has been removed by applying the jet clustering directly on top of the final state partons 



before hadronization. As opposed to the Wbb sample generated for the analysis presented in 1 159 1, the 
one-loop running of as was used instead of the two-loop running: this increased the LO cross section by 
« 30 % with respect to the value used in the mentioned analysis. 

The parton-level jets produced in this way can be compared to the fixed-order calculation described in 



Section 12.2 and will effectively include the approximative leading logarithmic resummation introduced 
by the parton shower algorithm. This involves a comparison of N produced jets with respectively the two 
or three leading jets of the LO or NLO fixed-order computation. The two 6-jets are selected out of the N 
jets produced by the parton shower algorithm by matching them to the two ^-quarks after radiation as 
traced in the Monte Carlo history: 

Ai?(jet, b - quark) < 0.4. 

In the case where more than one jet is closer to the 6-quark than AR = 0.4, the 5-quark is associated to 
the closest jet. As a cross-check of the eventual systematic uncertainty introduced by this choice, the 
same procedure is applied using the 6-quarks before radiation, which are produced directly by the matrix 
element computation, and the difference in the results is found to be negligible. Finally, the leading 
additional non 6-jet in the event is defined as the jet out of the jets produced by the parton shower which, 
after excluding the two jets matched to the fe-quarks, is the highest in transverse momentum. Events where 
only a single jet is matched to a 6-quark are thrown away; this happens typically when the two 6-quarks 
are closer than AR ^ 0.3 and are therefore included into a single jet. After the jets are defined, the 
kinematic cuts defined in the previous section are applied: in the case of the cuts applied on the W boson, 
its momentum is considered after the effect of reshuffling of momenta operated by the parton shower 
algorithm. The distributions based on the jets obtained by using the parton shower result are labeled as 
LO-i-PS in the next section. 

As a cross-check, also the result of the LO matrix element computation used as input to the parton 
shower algorithm is considered in the following and is labeled as LO only. In this case the two 6-quarks 
partons resulting from the ME computation are used directly for the computation of the observables 
and for the application of the cuts, except for the cut on the additional non 6-jet in the event, which is 
not applied. Both results are compared to the LO and NLO inclusive theory predictions obtained in the 
previous section, which are labeled as LO (theory) and NLO (theory). Note that NLO (theory) includes 
both the qq' initiated and qg + g^^-initiated processes. For comparison with the LO-hPS result which is 
based on the qq' — Wbb matrix element only, we also include the NLO QCD prediction for the total 
cross section of the g^'-initiated process separately, denoted as NLO(gg') (theory). 

12.4 RESULTS 

The cross sections are listed in Table[6] The two LO predictions agree fairly well, even if the comparison is 
slightly biased by a marginally different choice of the renormalization and factorization scale between the 
two computations. Once the parton shower algorithm is run and the kinematic cuts are applied, including 
the additional jet veto, the cross section turns out to be 30% lower with respect to the LO prediction. 
The complete NLO theory prediction, on the other side, predicts a cross section which is 14% higher 
than the same LO prediction. As a result, the parton shower prediction (LO-hPS) must be rescaled by 
an inclusive K factor of 1.64 in order to normalize it to the complete NLO computation. If only the qq' 
initiated process is included in the NLO prediction, the inclusive K factor only amounts to 1.16. The 
uncertainty on the complete NLO cross section estimated through variations of the factorization and 
renormalization scales by factors 1/2 and 2 is 5%. The impact of PDF uncertainties has not been 
considered here. 

The impact of the NLO corrections on the shapes of the distributions of most interest for the analysis 
are shown in the following, separately for W^bb and W~bb, in Figs. 32pT The crucial observable is the 



differential ratio between the NLO and the LO-i-PS predictions for the variable of interest: if this ratio is 
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Subprocess 


LO 


LO (theory) 


LO+PS 


NLO (theory) 


NLO(gg') (theory) 


W+bb 


144 


138^^^ 


97 






W-bb 


79 




52 


90+^ 




Sum 


223 


214+^^ 


149 


245lt^ 


173^^1 



Table 6: Central value of cross sections (in fb) obtained for the pp — )• Wbb process, comparing the 
five different methods. Details about the calculation of LO (theory) and NLO (theory) can be found in 
Section|12.2 



approximately flat, then it is possible to re-weight the Monte Carlo events produced by the parton shower 
algorithm by a simple inclusive K factor. 

Also the NLO to LO+PS ratios for the distributions for the pseudorapidity of the leading (Fig. [33] top) 



and subleading (Fig. 33 bottom) 5-jets are relatively flat, with the NLO computation predicting the 6-jets 
to be slightly more central. On the contrary the W bosons are significantly less central (Fig. 34 1 in the 
NLO prediction, and, as a consequence, a significant correction affects the distribution of difference in 
pseudorapidity between the bb and W boson systems, in particular at large pseudorapidities (Fig. 35 1. 

For the transverse momentum of the leading 6-jet (Fig. [32j top), apart from some threshold effect at 
values close to 100 GeV, the ratio NLO to LO+PS is relatively flat. For the transverse momentum of the 
subleading 6-jet (Fig. 32 bottom), the paiton shower approximation does a perfect job and the ratio is 
perfectly flat. 



A bit problematic is the case of the transverse momentum of the W boson system (Fig. 36 1 and, to a 
less extent, of the bb system (Fig. 37 ), where the NLO prediction is reliable towards higher transverse 
momenta, while, close to the threshold, it starts to be sensitive to the emission of multiple soft radiation 
(large logarithms). This is more severe for the distribution of the transverse momentum of the W boson 
system. In both cases, the LO+PS distribution could be re-weighted at large transverse momenta according 
to the NLO prediction, while the LO+PS based distribution itself could be used to extrapolate to the region 
close to the threshold. Correlations between the two distributions need clearly to be taken into account. 

The parton shower algorithm is able to reproduce fairly well the shape of the distributions of the bb 
invariant mass (Fig. 38 1, of the bbW invariant mass (Fig. 39 1 and of the distance in pseudorapidity and 
azimuthal angle of the two b-jets (Fig. 40). In the case of the bb invariant mass, a small discrepancy is 
seen at very small invariant masses, again in the region where eventually large logarithmic corrections to 
the NLO computation can be expected. 



Finally, the pT of additional non 6-jet distribution is shown in Fig. 41 The cross section corresponding 
to the events where no additional non 6-jet above 15 GeV is found is condensed in the first bin of the 
distribution. As a general tendency, the NLO computation predicts significantly more radiation than the 
LO+PS prediction. There are two possible reasons for this: 

1 . The parton shower algorithm used in this study acts on top of the LO matrix element producing 
Wbb, which means that it misses all radiation where the first splitting is of the type g ^ gg and the 
bb pair is produced later in the cascade, while the NLO prediction will include the case where the 
bb is produced right after the first g ^ gg splitting. 

2. The NLO computation includes the process q{q)g — )• Wbbj in the real correction term, which is 
strongly but not completely suppressed for small pT of the additional non 6-jet in the event. This is 
illustrated in Fig.|4T] where we also show the theory prediction when only including the g^-initiated 
process. 

Given this overall tendency, one would expect that by moving the jet veto cut at parton level from 60 GeV 
to 20 GeV (as in the cut based analysis), the K factor correcting from LO+PS to the NLO prediction 
would move significantly from 1.6 more towards one, which would be a clear advantage for the WH 
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analysis. However, this requires a reliable prediction of the transverse momentum of the additional non 
6-jet of the event down to very low transverse momenta (if compared with the hard scale of the event 
which is around pribb) 200 GeV). This is not provided by the NLO prediction, as the renormalization 
and factorization scale variation uncertainty of the exclusive 2 6-jet result reported in Table [5]of the last 
section demonstrates. In order to get a more reliable prediction in this kinematic region, some matching 
procedure between the parton shower and the NLO predictions is needed. Before being able to define such 
a matching procedure, further studies will be needed, based on parton shower Monte Carlo algorithms 
able to include the primary g ^ gg splittings (available in plain Herwig by using the pp — Wg matrix 
element), which would lead to a complete leading logarithmic prediction, or able to include the additional 
q{q)g — )■ Wbhj process, as Alpgen [64] (which however does not include the primary g ^ gg splittings). 
The use of these two algorithms will in fact allow to disentangle the two previously enumerated effects. 

12.5 CONCLUSIONS 

We have compared the NLO computation for the Wbb process with the predictions of the Herwig parton 
shower algorithm applied on top of the LO Wbb matrix element, considering for the first time the 
specific kinematic region explored by the high pT WH analysis and the effect of an additional non 6-jet 
veto (applied at parton level) of 60 GeV. Given the choice of PDFs, of strong coupling constant and 
renormalization and factorization scales already mentioned, we find that the cross section predicted by the 
NLO computation is a factor « 1.6 higher than the parton shower prediction. Many of the differential 
distributions predicted by the parton shower algorithm agree reasonably well with the NLO computation. 
In general, the increase in event yield due to the NLO correction can be decreased by further lowering the 
jet veto: however, the NLO prediction suffers from substantial uncertainties in the kinematic region where 
radiation significantly softer than the scale of the process occurs and further studies are ongoing to make 
the prediction more reliable. 
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Fig. 31: Renormalization and factorization scale dependence for the parton level W'^bb (upper plot) 
and W^bb (lower plot) LO (black, solid), NLO inclusive (red, solid), NLO exclusive (blue, solid) and 
Wbb + j (blue, dashed) production cross sections (cross sections are in pb). See Section 12.2 for details. 
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Fig. 32: Transverse momentum distribution of the leading and sub-leading b jet, for both W^bb and 
bb production. The lower window shows the ratio of NLO to LO+PS predictions (full line) together 
with the ratio between the two LO predictions (dashed line). 
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Fig. 33: Pseudorapidity distribution of the leading and sub-leading b jet, for both W^bb and W^bb 
production. The lower window shows the ratio of NLO to LO+PS predictions (full line) together with the 
ratio between the two LO predictions (dashed line). 
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Fig. 34: Pseudorapidity distribution of the W vector boson for both W~^bb and W~bb production. The 
lower window shows the ratio of NLO to LO+PS predictions (full line) together with the ratio between 
the two LO predictions (dashed line). 
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Fig. 35: Pseudorapidity difference distribution of the 2 6-jet system and W boson for both W~^bb and 
W~ bb production. The lower window shows the ratio of NLO to LO+PS predictions (full line) together 
with the ratio between the two LO predictions (dashed line). 
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Fig. 36: Transverse momentum distribution of the W vector boson for both W~^bb and W~bb production. 
The lower window shows the ratio of NLO to LO+PS predictions (full line) together with the ratio between 
the two LO predictions (dashed line). 
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Fig. 37: Transverse momentum distribution of the 2 6-jet system for both W~^bb and W~bb production. 
The lower window shows the ratio of NLO to LO+PS predictions (full line) together with the ratio between 
the two LO predictions (dashed line). 
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Fig. 38: Invariant mass distribution of the 2 6-jet system for both W'^bb and W~bb production. The 
lower window shows the ratio of NLO to LO+PS predictions (full line), together with the ratio between 
the two LO predictions (dashed line). 
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Fig. 39: Invariant mass distribution of the 2 6-jet plus W system for both W'^bb and W^bb production. 
The lower window shows the ratio of NLO to LO+PS predictions (full line), together with the ratio 
between the two LO predictions (dashed line). 
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Fig. 40: Relative distance distribution of the 2 6-jet system for both W~^bb and W~bb production. The 
lower window shows the ratio of NLO to LO+PS predictions (full line), together with the ratio between 
the two LO predictions (dashed line). 




Fig. 41 : Transverse momentum distribution of the non 6-jet for both bb + j and W~bb+j production. 
The lower window shows the ratio of LO to LO+PS prediction (dashed line including only the qcf — )• Wbbj 
process in the LO computation, full line including also the q{q)g — )• Wbbj process). Although the cross 
section for Wbb + j production is a LO result, it is calculated as part of the real corrections with the 
CTEQ6M set of PDFs and two-loop evolution of Og. 
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Part IV 

OBSERVABLES AND DETECTORS 



13. DELPHES, A FRAMEWORK FOR FAST SIMULATION OF A GENERIC COLLIDER 
EXPERIMENlE3 

13.1 INTRODUCTION 

Multipurpose detectors at high energy colliders are very complex systems. Their simulation is in general 
performed by means of the Geant | 166) package and final observables used for analyses usually require 



sophisticated reconstruction algorithms. This complexity is handled by large collaborations, and data and 
the expertise on reconstruction and simulation software are only available to their members. Precise data 
analyses require a full detector simulation, including transport of the primary and secondary particles 
through the detector material accounting for the various detector inefficiencies, the dead material, the 
imperfections and the geometrical details. Such a simulation is very complicated, technical and requires a 
large CPU power. On the other hand, phenomenological studies, looking for the observability of given 
signals, may require only fast but realistic estimates of the expected signal signatures and their associated 
backgrounds. 

The Delphes | 167) framework has been designed for the fast simulation of a general-purpose collider 



experiment. Using this framework, observables such as cross-sections and efficiencies after event 
selection can be estimated for specific reactions. Starting from the output of event generators (e.g. Les 
Houches Event File or HepMC), the simulation of the detector response takes into account the subdetector 
resolutions, by smearing the kinematics of final-state particles. 

Delphes includes the most crucial experimental features, such as: the geometry of both central and 
forward detectors, the effect of magnetic field on tracks, the reconstruction of photons, leptons, jets, 6-jets, 
r-jets and missing transverse energy, a lepton isolation, a trigger emulation and an event display. Several 
common datafile formats can be used as input in DELPHES, in order to process events from many different 
generators. 



13.2 SIMULATION OF THE DETECTOR RESPONSE 

The overall layout of the multipurpose detector simulated by DELPHES consists in a central tracking 
system surrounded by an electromagnetic and a hadron calorimeters (ECAL and HCAL, each with a central 
region and two endcaps) and two forward calorimeters (ECAL, with separate electromagnetic and hadronic 
sections). Finally, a muon system encloses the central detector volume. In addition, possible very forward 
detectors are also simulated, like zero-degree calorimeters (ZDC) and very forward taggers. Even if 
Delphes has been developed for the simulation of general-purpose detectors at the LHC (namely, CMS 
and Atlas), it allows a flexible parametiisation for other cases, e.g. at future linear colliders. 



Magnetic field, tracks and calorimetric cells 

The effects of a central solenoidal magnetic field are simulated. Every stable charged particle with a 
transverse momentum above some threshold and lying inside the tracker has a probability to yield a 
reconstructed track. The calorimeters are segmented into cells in the 0), with variable sizes. The 
response of calorimeters to energy deposits of incoming particles depends on their resolution and on the 
nature of the particles. The assumed calorimeter characteristics are not isotropic, with typically finer 
energy resolution and granularity in the central region |168[|169| . Their resolution is parametrised through 
a Gaussian smearing of the accumulated cell energy. 

^■^Contributed by: S. Ovyn, X. Rouby 
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Electrons and photons leave their energy in the electromagnetic parts of the calorimeters (ECAL and 
FCAL, e.m.) only, while charged and neutral final-state hadrons interact with the hadronic parts (HCAL 
and FCAL, had.). Some long-living particles, such as the and A's, with lifetime cr smaller than 
10 mm are considered as stable particles by the generators although they may decay before reaching the 
calorimeters. The energy smearing of such particles is therefore performed using the expected fraction of 
the energy, determined according to their decay products, that would be deposited into the ECAL and the 
HCAL. Muons, neutrinos and hypothetical neutralinos are assumed not to interact with the calorimeters. 
At last, the muon systems provide a measurement of the true muon kinematics, with a smeared transverse 
momentum. 



Forward detectors 

Most of the recent experiments in beam colliders have additional instrumentation along the beamline. 
These extend the r/ coverage to higher values, for the detection of very forward final-state particles. In 
Delphes, zero-degree calorimeters, roman pots and forward taggers have been implemented, similarly to 



the plans for Cms and Atlas collaborations |168[[T69J . 



The ZDCs allow the measurement of stable neutral particles (7 and n) coming from the interaction 
point (IP), with large pseudorapidities. The trajectory of the neutrals observed in the ZDCs is a straight 
line, while charged particles are deflected away from their acceptance window by the powerful magnets 
located in front of them. In their implementation in DELPHES, the ZDCs provide a measurement of the 
time-of-flight of the particle, from the IP to the detector location. The simulated ZDCs are composed of an 
electromagnetic and a hadronic sections, for the measurement of photons and neutrons, respectively. The 
ZDC hits do not enter in the calorimeter cell list used for reconstruction of jets and missing transverse 
energy. The fact that additional charged particles may enter the ZDC acceptance is neglected in the current 
versions of DELPHES. 

Forward taggers, located very far away from the IP, are meant for the measurement of scattered particles 
following very closely the beam path. To be able to reach these detectors, particles must have a charge 
identical to the beam particles and a momentum very close to the nominal value of the beam. These taggers 
are near-beam detectors located a few millimetres from the true beam trajectory and this distance defines 



their acceptance. This fast simulation uses the HECTOR software 1 170 1, which includes the chromaticity 
effects and the geometrical aperture of the beamline elements of any arbitrary collider Forward taggers 
are able to measure the hit positions and angles in the transverse plane at the location of the detector, 
as well as the time-of-flight. Out of these, the particle energy and the momentum transfer it underwent 
during the interaction can be reconstructed at the analysis level. 



13.3 HIGH-LEVEL OBJECT RECONSTRUCTION 

The results of the detector simulation are output in a file storing for each event its tracks, calorimetric cells 
and hits in the very forward detectors. In addition, collections of final particles (e^, fi^, 7) and objects 
(light jets, fe-jets, r-jets, E}^^^^) are provided. While electrons, muons and photons are easily identified, 
other quantities are more difficult to evaluate as they rely on sophisticated algorithms (e.g. jets or missing 
energy). For most of these objects, their four-momentum and related quantities are directly accessible in 
DELPHES output (E, p, pT, 7] and 0). 



Identification of electrons, photons and muons 

Real electron (e^) and photon candidates are associated to the final-state collections if they fall into the 
acceptance of the tracking system and have a transverse momentum above some threshold. Assuming a 
good measurement of the track parameters in the real experiment, the electron energy can be reasonably 
recovered. DELPHES assumes a perfect algorithm for clustering and bremsstrahlung recovery. Electron 
energy is smeared according to the resolution of the calorimetric cell where it points to, but independently 
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from any other deposited energy is this cell. Electrons and photons may create a candidate in the jet 
collection. 

Generator-level muons (/i^) entering the muon system acceptance and overpassing some threshold 
are considered as good candidates for analyses. The application of the detector resolution on the muon 
momentum depends on a Gaussian smearing of the px- Neither t] nor (p variables are modified beyond the 
calorimeters: no additional magnetic field is applied. Multiple scattering is neglected. This implies that 
low energy muons have in Delphes a better resolution than in a real detector 

To improve the quality of the contents of the charged lepton collections, isolation criteria can be applied. 
This requires that electron or muon candidates are isolated in the detector from any other particle, within a 
given cone in the {r],(p) plane. The isolation algorithm is based on tracker data, but in addition the sum of 
the transverse momenta of all tracks but the lepton one within the isolation cone is provided, as well an 
estimate based on calorimeter data. 



Jet reconstruction 

A realistic analysis requires a correct treatment of partons which have hadronised. Therefore, the most 
widely currently used jet algorithms have been integrated into the DELPHES framework using the FastJet 
tool 1 171[ . For all jet algorithms, the calorimetric cells are used as inputs. Since several particles can 
leave their energy into a given calorimetric cell, which broadens the jet energy resolution, a jet energy 
flow algorithm can be switched on in DELPHES. This takes into account the measured properties of tracks 
pointing to the calorimetric cells of interest for the jet reconstruction. 

A jet is tagged as a 6-jet if its direction lies in the acceptance of the tracker and if it is associated to a 
parent 6-quark. This identification procedure for the 6-tag is based on the true generator-level identity of 
the most energetic parton within a cone around the (r/, 4>) region, with a radius equal to the one used to 
reconstruct the jet. Jets originating from r-decays are identified using a procedure consistent with the one 
applied in a full detector simulation |^168J. The tagging relies on the two following properties of the r 
lepton. First, 77% of the r hadronic decays contain only one charged hadron associated to a few neutrals. 
Secondly, the particles arisen from the r lepton produce narrow jets in the calorimeter. 



Missing transverse energy 

The true missing transverse energy (MET), i.e. at generator-level, is calculated as the opposite of the vector 
sum of the transverse momenta of all visible particles. In a real experiment, calorimeters measure energy 
and not momentum. Any problem affecting the detector (dead channels, misalignment, noisy cells, cracks) 
worsens directly the measured missing transverse energy. In DELPHES, MET is based on the calorimetric 
cells only. Muons and neutrinos are therefore not taken into account for its evaluation. However, as muon 
candidates, tracks and calorimetric cells are available in the output file, the missing transverse energy can 
always be reprocessed a posteriori with more specialised algorithms. 



Trigger emulation 

Most of the usual trigger algorithms select events containing leptons, jets, and MET with an energy scale 
above some threshold. A trigger emulation is included in DELPHES, using a fully parametrisable trigger 
table, also allowing logical combinations of several conditions on the final analysis data. 



13.4 ASSUMED SIMPLIFICATIONS FOR THE FAST SIMULATION 

DELPHES is a fast simulation aiming at providing quickly realistic observables. It relies on several 
hypotheses and simplifications that allows the use of less sophisticated algorithms than in real experiments, 
but aims at reaching the same detection, identification and reconstruction performances. Since the 
framework design relies on the expectations of the real detector performances, detector geometry is 
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idealised: being uniform, symmetric around the beam axis, and having no cracks nor dead material. 
Secondary interactions, multiple scatterings, photon conversion and bremsstrahlung are also neglected. 

No longitudinal segmentation is available in the simulated calorimeters. Delphes assumes that ECAL 
and HCAL have the same segmentations and that the detector is symmetric with respect to the ry = 
plane and around the beam axis. A particle entering a calorimetric cell deposes all its energy, even if it 
enters very close to its geometrical edge. Particles other than electrons (e^), photons (7), muons (/x^), 
neutrinos (fg, i^^ and Ut) and hypothetical neutralinos are simulated as hadrons for their interactions 
with the calorimeters. The simulation of stable particles beyond the Standard Model should therefore be 
handled with care. 

The electron, photon and muon collections contain only real candidates. For instance, the particles 
which might leak out of the calorimeters into the muon system (punch-through) are not considered as 
muon candidates in DELPHES. However, fake candidates can be added into the collections at the analysis 
level, when processing DELPHES output data. 

In current version of DELPHES, the displacement of secondary vertices is not simulated, in particular 
for 6-jets. Extra hits coming from the beam-gas events or secondary particles hitting the beampipe in 
front of the forward detectors are not taken into account. At last, real triggers are intrinsically based on 
reconstructed data with a worse resolution than final analysis data. In DELPHES, the same data are for 
trigger emulation and for final analyses. 



13.5 IMPROVEMENTS AND PERSPECTIVE 

As a consequence of the fruitful discussions and collaboration during and after the Les Houches workshop, 
several improvements have been identified for the developments in DELPHES. The 6-jet algorithm will be 
improved by adding a dependence of the tagging efficiencies with respect to the jet transverse momentum. 
In current versions of DELPHES, the probabilities for 6-tag or mistag are uniforms, which underestimates 
the actual performances of the real experiments. The r-jet collection will allow 3-prong r's. Local 
detector inefficiency maps might also be implemented, as well as an improved interface to HepMC that 
would allow an easy connection to tools like Rivet HSl. 



CONCLUSIONS 

Delphes is a framework offering fast simulation tools allowing to investigate quickly new models and 
check their signatures in a realistic detection environment. It offers an intermediate step between simplified 
parton-level analysis and extensive analyses using the full-simulation power in large collaborations. 
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14. EFFECT OF QED FSR ON MEASUREMENTS OF Z/7* AND W LEPTONIC FINAL 
STATES AT HADRON COLLIDERS E3 

14.1 Introduction 

Due to its simple quantum structure and clean experimental signature, quark-antiquark annihilation to 
lepton pairs (Drell-Yan), is one of the most important channels in hadron collider physics. In this paper 
we shall discuss some requirements that must be met, at the observable level, when using information 
obtained from the final-state leptons in this process. The emphasis on "observable level" is particularly 
important; if information about states that are not themselves directly observable, such as the Z boson or 
a "bare" lepton in a Monte Carlo event record, is used for correcting or calibrating the raw measurement, 
then the precision of the corrected/calibrated result will suffer from an intrinsic ambiguity which was not 
present in the raw measurement, i.e., the result will have been degraded. 

We discuss the cause and magnitude of this degradation and argue that the experimental corrections 
and calibrations can equally well be performed without using such information, hence preserving the full 
precision of the raw measurement in the calibrated result. 

To illustrate our conclusions and to aid future studies, we examine various possible observable defini- 
tions and present a collection of reference comparisons for the Tevatron processes pp — )• Z/7* — )• e+e~ 
and pp — )■ Z/7* — > . These same concerns apply to the equally important W signal, which we also 
consider 

14.2 Monte Carlo Truth 

Calculations of collider physics processes rely heavily on factorizations of the full transition amplitudes 
(squared) into smaller, more manageable, pieces. These factorizations are typically formally correct in 
limits in which one single Feynman diagram dominates over all others, in which case the system has a 
well-defined "semi-classical" history, represented by the single dominant diagram. The individual pieces 
can then be treated as approximately independent since the quantum mechanical interference between 
them can be neglected in this limit. 

Essentially, what event generators, such as Herwig, Pythia, and Sherpa, provide as intermediate particles 
in their event records ("Monte Carlo truth") are representations of such semi-classical histories. At the 
very least, these histories show how the generator approximated the process so far (which may, in turn, 
furnish important boundary conditions on subsequent generation steps). But even in the best case scenario, 
when the diagram represented by the event record really is the dominant one, it is still only exact when all 
other diagrams are zero, i.e., in the semi-classical limit. Therefore, any procedure (such as a correction 
applied to an experimental measurement) that depends on history-information is only well-defined in that 
limit. At the quantum level, it would suffer from an irremovable ambiguity which would set an ultimate 
limit to the precision that could be obtained with that procedure. 

Let us consider the Drell-Yan process in more detail. For this process, e.g., the event record of the 
Pythia generator will contain an intermediate Z boson. This is intended to tell the user that the lepton 
pair comes from an s-channel diagram (which in fact includes the full Z/7* interference), and it also lets 
Pythia's parton shower know that the invariant mass of the lepton pair should be preserved in the shower 
So far so good. If QED radiation was not an issue, this intermediate Z boson is identical to the sum of the 
final-state lepton pair, and it is therefore tempting to use this boson, even in the presence of QED effects, 
to define some kind of Monte-Carlo-truth / generator-level / QED-corrected / ... "Z/7*". However, when 
including QED effects, diagrams with photons emitted in the initial state will interfere with diagrams 
that have them emitted in the final state. Therefore, we cannot, at the amplitude-squared level, determine 
whether a given photon was initial- or final-state radiation, not even statistically. However, the two cases 
correspond to different semi-classical histories, with different "Z/7*" kinematics, and therefore any 

^^Contributed by: A. Buckley, G. Hesketh, F. Siegert, P. Skands, M. Vesterinen, T.R. Wyatt 
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semi -classical "Z/7*" definition would at least be ambiguous up to the size of the initial-final interference 
terms. 

Another manifestation of the same basic problem is to do with the identification of which leptons came 
from the Z/7* decay, in an event with multiple leptons. Again, a correction procedure that would depend 
on Monte Carlo truth for this identification would be fundamentally ambiguous at least up to interference 
terms between diagrams with different assignments. 

Finally, there is the issue of how to define a lepton. Especially electrons, which are close to massless, 
emit large amounts of near-coUinear photon bremsstrahlung. The final-state electron in a Monte Carlo 
event record is therefore not a uniquely defined object (it depends, for instance, on if/how electron mass 
effects are implemented in that Monte Carlo). As we argued above, the electron "before" QED radiation 
is also not well-defined, since the electron interferes with other particles. And in any case it would 
be a mere coincidence if either object bears much resemblance to what might be called an electron in 
an experimental setting (to be discussed further below). It is therefore also crucial to operate with an 
unambiguous and coUinear safe "lepton definition". 

What we shall argue below is that such ambiguities are in fact both present and large for precision 
measurements of current interest. It is therefore important to reiterate that the full experimental precision is 
only preserved if the observables and correction procedures are defined entirely in terms of the final-state 
leptons, with these in turn defined in a way that is colUnear safe against photon emission. 

14.3 Definitions 

In previous measurements of the kinematics of the Drell-Yan process, the reconstructed data have typically 
been corrected to the "Z/7*" boson level. This correction process can be broken into three steps: 

1. Correcting the measurement leptons to the level of particles entering the detector ("unfolding" the 
detector resolution and efficiency). 

2. Correcting from the particles entering the detector to the "Z/7*". 

3. Correcting from the measured phase space to Att acceptance, with no selection cuts except on the 
"Z/7*" mass. 

As outlined in the previous Section, Step 2 introduces ambiguities and dependencies on the model 
used to simulate Drell-Yan and FSR. Step 3 involves correcting for particles that were unmeasured, so 
explicitly depend on a model of those particles. We therefore propose that all measurements be made 
available after Step 1: corrected only for detector resolution and efficiency. Steps 2 and 3 may of 
course be performed in addition, subject to problems discussed above. 

In order to perfonn Step 1, we must define the "electron" and "muon" observables in tenns of the 
particles that enter the detector, accounting for the different detector response in each case. We then 
consider how to form the "Z/7*" from those electrons and muons, and briefly discuss missing energy and 
a "W" final state. We therefore propose these definitions: 

• In simulated events, we consider all generator level particles with cr > 10 mm as "stable" |[T| (i.e. 
could possibly be detected). Only the stable particles may be used to define an observable, and no 
checks should be placed on the internal generator origin of any particles (no history dependence). 

• Electrons: to mimic the response of a calorimeter, electrons and photons must be combined into 
an localized "EM cluster", and such clusters considered as particle level electrons. Here, this 
clustering is done with simple cones of Aii = v/(A^)2 + (A7/)2=0.2 around any electroij^ Thus 
any narrow angle QED FSR is combined back into the electron, which may also pick up some 
additional electromagnetic energy from the underlying event. Any wider angle FSR is "lost": i.e. 
not associated with the electron. 

^*This simple cone clustering is sufficient to reproduce observable electrons at the Tevatron experiments. For the LHC 
experiments, a different cone size or more advanced clustering algorithm (e.g. anti-fcr) may better reproduce the detector level 
observable. Ideally, the same particle level electron definition can be agreed upon for all LHC experiments. 
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• Muons: the stable muons (i.e. after FSR) can be considered directly, as would be measured in the 
tracking system of a detector. Thus, all FSR is lost, and the underlying event has no effect. 

• Missing transverse energy (MET): in data this is typically calculated by inverting the vector sum of 
calorimeter Et and muon pT, introducing many detector acceptance and efficiency effects which 
are difficult to reproduce at particle level. To initiate discussion, we define the MET simply from 
the vector sum of all neutrinos in the event, which is compared to the pT of only the the W neutrino 
in Fig. [42] 

• Z/j*: selections should follow the data analysis. Typically, this will mean considering all electrons 
or muons within a given \r]\ range, then forming opposite charge pairs that lie within the required 
di-lepton mass window (e.g. 65-1 15 GeV). In the (rare) cases where two possible pairs pass these 
selections, the method used to select the "best" should again follow the data analysis, such as the 
pair closest to the Z mass. 

• W: again, selections should follow data analysis. This will typically mean combining the MET 
with the highest px lepton, then placing some requirement on transverse mass. 
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Fig. 42: Comparing the pT of the neutrino from a W decay to the particle level MET (defined in the text). 

When defining the Z/7* and W observables, three further selections are typically applied in data 
analysis that have a less clear particle level analogue. Here we provide suggestions as the basis for future 
discussion: 

1. Lepton isolation: reproducing detector level isolation requirements at particle level is difficult, 
whereas determining the efficiency loss due to such requirements in data is generally rather straight- 
forward. So we suggest the measured data should be corrected for these efficiency losses, and no 
isolation conditions be imposed at particle level. 

2. Minimum lepton pT (and MET) requirements: such requirements should generally be implemented 
at the particle level also. However, if the effect is small it may be possible to use the simulation to 
extrapolate into the unmeasured region. 

3. Event vetoes: for example, a second lepton veto in W selection. Due to detector efficiency effects, 
such requirements are difficult to implement in a way that is consistent between detector and particle 
level, and should be treated with care. 

14.4 Implications 

To study the impact of the definitions outlined in the previous section, some Tevatron examples are used. 
Samples of pp — ;> Z/j*, with Z/7* — ;> e+e~ and Z/7* — )• //+fi/x~, are produced using Pythia 6.421 JlSj, 
using the Perugia 6 tune f62l with the CTEQ6L1 PDFs fl561. The samples are normalized to the same 
number of generated events. For comparison, we also consider a "generated Z/7*", reconstructed by 
searching the Pythia event record and explicitly taking the two leptons from the Z/7* decay, before FSR. 
Regardless of the definition used, the leptons used to make a Z/7* are required to have \7]\ < 1.7 and a 
dilepton mass between 65 and 115 GeV. 
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14.5 FSR Properties 



First, the properties of pliotons from FSR are considered. Figure 43 shows, for — )■ e~^e^ and 
Z/7* — /i+fi/i~, the rates at which FSR photons are emitted from the Z/j* decay products. Next, AR 
between FSR photons and the nearest Z/7* decay product. Then, the FSR photon energy and the energy 
of "unmeasured photons" is shown; in the case of muons, all photons are considered unmeasured; for 
electrons, photons outside the cone of 0.2 are considered unmeasured. Finally, the fraction of all photons 
within 1 77 1 < 2.5 (typical electromagnetic calorimeter coverage) in the final state which arise from FSR 
for different pT requirements. 




-0.5 0.5 1 1.5 2 2.5 3 3.5 

# FSR photons 



-Non-FSB 

■ Dielectton FSR 

■•Dimuon FSR 



^ Dielectron FSR 
3 Dimuon FSR 



Fig. 43: Left: the rate of FSR photons from the Z decay products; centre left: AR{'y, I); centre: FSR 
photon energy; centre right: energy of "unmeasured" FSR photons (see text); right: the fraction of all 
photons that arise from FSR, at different energies. 



It can be seen that the rate of FSR is significant. In Z/7* — )• e^^e^, the photons are typically close 
to the electron and will be added to the electron cluster, however (using the default Pythia settings) 
approximately 23% of Z/^* — )• e"^e~ events will still contain at least one lost FSR photon (compared to 
66% of Z/'y* — )• /i+B//" events). These photons are typically of very low energy, though approximately 
3% of Z/y* — e+e^ and 7% of Z/y* — ^u^B/^^ events contain an unmeasured FSR photon above 
5 GeV. Identifying these lost FSR photons is almost certainly experimentally impossible, and it also 
appears impossible at the level of stable particles (without looking into the generator event record), due to 
the presence of photons from other sources (primarily vr*^ decays). In the following sections, we consider 
how these effects result in differences between the generated Z/7*and the measured dilepton final state, 
for some important kinematic properties. 



14.6 Energy Scale and Resolution, and Total Cross Section 

The energy scale and resolution for both the electromagnetic calorimeter and tracking system are typically 
derived by fitting the shape and position of the Z/7* peak. Here, we assess the impact of QED FSR on 
the measured Z/y* line-shape, and hence possible effects on detector calibration. Figure 44 shows the 
generated Z/j* mass distribution, and that reconstructed using the stated electron and muon definitions. 

It can be seen that FSR has a drastic effect on the observable line-shape below the pole. Obviously 
calibrations should ensure the detector response reflects the observable leptons, and not "correct" them 
to the generated Z/y*, so using the lower side of the mass peak would clearly rely upon a model of 
FSR. This introduces model dependence which may complicate the extraction of model independent 
quantities from data analysis. However, the energy scale may possibly be determined with minimal model 
dependence by fitting only the peak position, which is mostly unaffected by FSR. Similarly, it may be 
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possible to determine lepton resolution in a model independent way from the higher edge of the mass 
peak, which is also mostly unaffected. 
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Fig. 44: Comparing the generated Z/j* to the observable (defined in the text). Left: the Z/7* mass; 
centre: the Z/7* px; right: the Z/7* ay. 



The effect of FSR on the Z/7* line-shape has another implication: the inclusive Drell-Yan cross section 
is typically measured for a given mass range, for example 65-1 15 GeV. Energy lost in FSR causes the 
dilepton mass to be below the "true" Z/'y* mass, which means events above the mass window may migrate 
in, and events in the window migrate out. In the Pythia samples considered, this causes a 0.9% net loss of 
dielectron events, and a 2.1% net loss of dimuon events for the stated mass window. 



14.7 Z/j* pt and ax 

We next consider the impact on the Z/j* pT and qt 1 172 173 1 distributions, important variables in 
measuring non-perturbative QCD form factors and in generator tuning. Figure 44 shows the observable 
Z/7* Pt and ax compared to the true Z/y* px and ax- For px, effects of up to 4% are seen in the 
dielectron channel, and up to 10% in the dimuon channel. Effects are around 2% for ax, where the shape 
effect is larger in the electron channel. For both px and ax, the effects are largest at low values, the crucial 
region for generator tuning and determining form factors. 



14.8 Recovering Published Measurements? 

Several Tevatron measurements of the Z/7* px have corrected back to the "generated" Z/y* , with full 
Ait acceptance 1 174- 176| . Similarly, it is common practice in theory papers to present results at the same 
level (see a recent example in Ref. [177]). We perform a quick study to determine if such measurements 
and calculations can be approximated using only the stable final state particles, rather than having to 
resort to looking into the generator event history. This is done by using a wider cone around the final 
state electrons, to attempt to re-sum more FSR and get closer to the leptons directly from the Z/7* decay. 
Figure 45 shows the effect of using a cone of radius 0.2, 0.5 and 1.0. While the cone size of 0.5 comes 
closer to the "true" Z/y* px in the low px region, it moves further away at high px- Increasing the cone 
size to 1.0 clearly distorts the spectrum, with too many non-FSR photons being added to each electron. 
Similar behaviour is observed for ax- 
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Fig. 45: The effect on the Zj^* pT (left) and ax (right) of changing the electron cluster size. 



This simple study shows the difficulty in reproducing the generated Z/j* from only the stable particles, 
and the precision to which such "boson level" results can for example be implemented in Rivet (see 
Section |5]] and used for generator tuning with Professor | |47| . 

Fully reproducing published Z/j* pT results may only be possible by mimicking the data treatment: 
applying a "FSR correction factor" to the spectrum reconstructed from the stable particles. Such treatment 
introduces similar ambiguities and model dependence as the original data analysis which corrected back 
to the "Z/7*". 



14.9 Combining Electron and Muon Channels 

Experimentally, combining electron and muon channel measurements is considered a simple way to 
increase the statistics used in a measurement, and benefit from the fact that many experimental systematics 
are uncorrelated between the two channels. However, we have shown that the electron and muon 
observables are fundamentally different in both the measured cross section and reconstructed kinematics 
are different in each channel. Further complications are introduced by the different acceptance of 
calorimeters and muon detectors. These factors degrade the value of a combination, for which the same 
observable must be extracted from each channel, and it is impossible to do this without relying on a model 
of FSR. 

If a combination must be done, a "minimally model dependent" method would be: 

1. Restrict electron and muon measurements to the same phase space. This will typically mean 
excluding some region from one of the channels, due to different coverage of muon detectors and 
calorimetry. 

2. Apply a theoretical factor to the muon channel measurement, to correct from the observable muons 
to an equivalent of the electron observable: in this study, stable muons + all photons and electrons 
in a 0.2 cone around each muon. This factor should be published along with the combination. 

In simulated events, there are then two options: apply the same theoretical factor to simulated 
Z/7* — /i+fi/i^ as used for the muon data, then combine; or, directly treat stable muons as electrons 
(clustering EM energy around each muon). Figure 46 compares the px and for the muon channel 
calculated in this direct way to the electron channel, and good agreement is seen. 
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Fig. 46: The Z/j* px (left) and ax (right) using the same treatment of electron and muon final states. 



14.10 Conclusion 

We advocate that all future experimental measurements and theoretical predictions of the Z/j* and W 
(and other leptonic final states) should be presented in terms of observables, and not extrapolated beyond 
the measured acceptance. To this end, we propose definitions of observable electrons, muons and the 
Z/j*, as well as missing transverse energy and the W boson, all at the level of particles entering a 
detector. These definitions mimic the quantities measured in a detector, and provide an unambiguous, 
model independent basis to compare data and theory. 

While it is essential to provide the data in terms of observables, it may still be desirable to derive further 
theoretical corrections for FSR and acceptance, for comparisons to previous results. We recommend such 
correction factors be provided in a table, rather than being applied to the data. Using this table, (the inverse 
of) such corrections could also be applied to a calculation which does not include FSR or acceptance 
requirements, to allow direct comparisons to the data while maintaining the separation of measurement 
and theory. 
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Party 

JETS AND JET SUBSTRUCTURE 

15. STATUS OF JET SUBSTRUCTURE STUDIES 
15.1 INTRODUCTION 

In the 2007 Les Houches workshop [T] there was quite some discussion on how to define jets in ways 
which allow for clear comparison between experiments, and between measurements and state-of-the-art 
theory. The importance of such discussion has been emphasised since both by the adoption of new jet 



finders by ATLAS and CMS for their first data (in particular the anti-A;^ algorithm 1 178 1), and by the 
rapid expansion of activity aimed at exploiting the internal properties of jets in studies and searches for 
new physics at the LHC. 

Jet substructure has been studied at previous experiments as a means of investigating QCD. In fact 



jet shapes and subjet multiplicities have been used to measure the strong coupling as \ 179 180 1. What 
leads to a huge increase in interest at the LHC is the same fact that makes the LHC such an exciting 
project in general - the energy reach is substantially higher than the electroweak symmetry-breaking scale. 
One consequence of this which has been well appreciated for some time is that large multiplicities of 
electroweak-scale objects may be produced (e.g. W,Z, jets with pT ~ O(IOO) GeV). This has been a 
spur to a large amount of theoretical activity on higher order calculations and parton- shower-matched 
Monte Carlo simulations which has been a feature of many workshops, including this one. 

A consequence of the high energy of the LHC which was more slowly appreciated, but which is now 
quite widely understood, is that objects with masses around the electroweak scale may be produced well 
above threshold, and thus be highly boosted. When boosted objects decay to hadrons, the decay products 
will be collimated and may emerge as a single jet with distinctive substructure, even if the decay is initially 
to two or more quarks or gluons. It is easy to understand how such a situation will complicate the mapping 
from jets to partons which is the basis of almost every phenomenological study involving jets. In addition 
to the merging of jets due to high boosts, at the LHC, there will be unprecedented contamination from the 
underlying event and pileup, further deforming the naive jet-to-parton map. The basic goal of much of the 
work in jet substructure has been to improve the mapping from jets to partons, so that it can remain useful 
at the LHC, in light of new classes of signatures (like boosted object decays) and the extra contamination. 

Jet substructure techniques were used first in studies of diboson production for identifying the hadronic 
decays of vector bosons in diboson and SUSY production 1 181 -[183 1, and in the 2007 workshop there 



was much discussion on boosted-top reconstruction, for example [184]. In the last two years, techniques 



have been developed and applied to several channels including top production 1 185 - 187 1 as well as 



Higgs 1 158[|159[|188[[T89] and neutralino 1 190[|191] searches, and several proposals have been made to use 



substructure information to improve jet mass resolution in general 1 192- 194 1 using substructure. Many 
of these techniques have been investigated by the experimental collaborations, using realistic detector 
simulations, and survived the test. 

Another feature of the LHC relevant to jet substructure is that the calorimeters and trackers are superior 
to those at previous machines, allowing access to more detailed information about the jet itself. This 
information should be utilized in a more refined substructure analysis, but is only starting to be exploited. 
As the physical information from particle flow or topo-clusters becomes better understood, the power of jet 
substructure analyses will certainly improve. It is not surprising that a next-generation machine requires 
next-generation analysis tools, and the work on jet-substructure has just begun to probe the possibilities. 
Jet substructure is a rapidly moving field, with at least two dedicated workshops held since the meetings 



in Les Houches 1 195 1 and more planned. Here we give a snapshot of the current status, focused (but not 



exclusively) on some developments which took place in this workshop. 



^'Contributed by: J. M. Butterworth, M. D. Schwartz 
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15.2 GENERAL FEATURES 

In QCD, the evolution from a hard parton to a jet of partons takes place in a regime where the energy scale 
is high enough to use perturbation theory, x is not very small, and coUinear logarithms are large. This is 
essentially the kinematic region where DGLAP evolution should apply. This is largely understood within 
QCD, and can be calculated. It forms the basis of the paiton-shower models implemented in the most 
widely used MC generators. Since the phase space between the jet px and the confinement scale is large, 
there is plenty of radiation, and parton multiplicities will be large. Confinement enters the picture at the 
hadronization (non-perturbative) stage, and is expected to have a small effect (at the sub-GeV level). In 
general, the aspects of QCD relevant for jet substructure seem to be well-modelled by tuned Monte Carlo 
simulations, as is borne out by the available data 1 179[ 196 -200|. 



There are two somewhat distinct goals of the recently developed techniques. One is to improve the 
single jet mass resolution, and the other is to distinguish jets originating in heavy object decays from 
those coming from pure QCD backgrounds. The first of these goals requires a careful definition, in 
line with the discussion of jets in the last workshop 11]. Jet mass must be defined in terms of infra-red 
and coUinear safe variables on the one hand, and in terms of physical observables on the other, to avoid 
making completely model-dependent "measurements". However, the jet mass resolution may be optimised 
using models, in paiticulai" models which implement or mimic jets from the decay of a colour- singlet, 
by switching off initial state radiation; and models which mimic jets from parton (rather than proton) 
collisions by switching off underlying event. The second goal, background suppression for heavy particle 
decays, generally relies on the fact that in QCD the various splittings within the jet are expected to be 
strongly ordered in scale, with large asymmetric splittings, whereas partons resulting from a heavy particle 
decay will in general share the energy equally in the particle rest frame, leading to more symmetric 
configurations. 



15.3 FILTERING 

Let us begin with a discussion of what shall refer to generically as, filtering. Filtering refers to the removal 
of components of a jet in an attempt to clean up the jets in a certain way. Variations on this theme have 
appeared, such as pruning [192] and trimming [ 194]. The basic idea of filtering is that one first finds a jet 
which is assumed to contain the radiation one is interested in as well as contaminating radiation, then one 
removes the contamination with the filtering step. The difference in the algorithms is more a difference 
in how the authors are imagining their routines used than in the way the routines themselves work. So 
we will organize the discussion of filtering by application. Some approaches are application-specific, 
optimized to find a particular signal, such as boosted Higgs or boosted top decays, while other applications 
are more general. We will be begin with the specific ones. 



15.31 FILTERING FOR SPECIFIC APPLICATIONS 

The most productive application of filtering so far has been in the search for heavy boosted objects which 
decay to standard model jets. Consider first a Higgs boson decaying to bh in associated production with 
a W. This channel is the discovery channel for a light (~ 120 GeV) Higgs at the Tevatron, but is very 
difficult at the LHC due to much more complicated backgrounds, in particular ti production. Going to 
a regime where the Higgs is boosted {pT > 200 GeV) has the effect of essentially eliminating the tt 
background and removing a lot of the T1^-i-jets background as well. In this case, the 6-jets from the Higgs 
decay may be within Ai? = 1.0 of each other, so finding separate R = 0.7 jets may fail. The substructure 
approach pioneered in |[158 | was to find the jets together as a fat jet, with a cone size of i? = 1.0 or greater, 
then to parse the jets to find the 6-quarks as subjets. 

To be specific, in flSS], the authors use the Cambridge/Aachen algorithm [201 ''202'] to construct the fat 
jet. This algorithm merges constituents based on the distance Ai? = -\/(Ary)2^F(A(^ between them. 
Then they undo the clustering steps using the clustering history (conveniently stored by FastJet [203 1 
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within the jet object). Somewhere in the declustering the b should be separated. However, the way 
Cambridge/Aachen works does not guarantee that the the final merger is of the two 6-jets into the fat 
jet. Therefore, when tracing back through the declustering, one needs a different criterion, for example, 
looking for large mass drop - the masses of the subjets should be much smaller than the mass of the fat jet. 
Once a large mass drop is found, the process is stopped. All constituents of the jet which would have been 
split at this stage are retained, and the clustering algorithm is rerun, using the angle between the centres of 
the last two subjets to set a (smaller) angular scale. This effectively optimizes on an event-by-event basis 
the selection of final state radiation from the (color singlet) Higgs candidate, while rejecting radiation 
from elsewhere, including the underlying event. This is the step described as "filtering" in the analysis. 
The top two or three subjets are combined to reconstruct the Higgs mass. The possible inclusion of a 
third jet, rather than just two (representing the b and b) allows for an extra hard gluon emission. This final 
step is critical in getting the algorithm to perform well at the Higgs mass reconstruction. The result was 
confirmed by an ATLAS study y_59j|, and revived H ^ bb as a. discovery channel for the Higgs at the 
LHC. 



Another application of filtering has been in hadronic boosted top decays 1 185 1. Here, the same basic 
principle is applied: a fat jet is found, subjets are located through declustering, and a filter is applied to 
remove unwanted radiation. In this case, the three or four hardest subjets are filtered out. Three of the jets 
should represent the hadronic top decay products: the 6-jet and the two quark jets from the W decay, the 
fourth may be an additional hard gluon. Using this approach, the authors found 99% background rejection 
efficiency at 40% signal efficiency, per jet. Therefore signal/background for boosted hadronic tt events 
over dijets is enhanced by a factor of 1000. A CMS study confirmed these efficiencies with full detector 
simulation | |204] . 

Since the boosted Higgs and boosted top taggers described above rely on reversing the clustering 



sequence of a jet, there is naturally a dependence on the jet algorithm. We show in Fig. 47 a comparison of 
different algorithms for a variable used by the CMS top-tagging study |204| . Of the 3 or 4 subjets filtered 
out of the top jet, they looked at the minimum invariant mass of any pair. In the first panel, clustering with 
Cambridge/Aachen shows the peak at ~ 80 GeV corresponding to the W-mass in the signal, which is 
absent in the background. The second peak around ~ 20 GeV corresponds to wrongly identified subjets 
and mimics the QCD background. The result for the k±_ algorithm is similar. The anti-A;^ algorithm, 
however, does not show the H^-mass peak. This is because anti-fci^ essentially works backwards to the 
other clustering algorithms, with the hardest particles being clustered first, and the soft stuff clustered 
at the end. One can see that anti-fc^ is not ideal for filtering and substructure analysis. However, the 
final panel shows that if the fat jet is found using anti-fc^, but the particles in the jet are reclustered using 
Cambridge/Aachen, the substructure is still there. We conclude that the substructure analysis and the 
fat-jet finder can be chosen independently. 

15.32 GENERAL FILTERING TECHNIQUES 

Having seen filtering successfully applied in a number of specific cases, it is natural to ask whether the 
approach can be generalized to cases when you do not know a priori what it is you are looking for. For 
example, in the Higgs—;- bb study, the fat jet was filtered to look for 2 or 3 hard subjets, in top-tagging, 
the fat jet was filtered to look for 3 or 4 hard subjets. The idea behind jet pruning is that is is possible 
to filter the jet without knowing ahead of time how many hard constituents to expect. Instead, one can 
decompose the jet until some measure of the size of each splitting is saturated. For example, in 1 192[ , 



the pruning algorithm based on Cambridge/Aachen declustering stops when a splitting has Aij > Dcut, 
where Dcut oc 2mj /px is determined on a jet by jet basis. The authors demonstrated that this general 
method successfully finds the subjets in the specific cases of boosted top or boosted jets. 

Another general approach is jet trimming \ I94| . Here the goal is not to find hard subjet constituents in 



fat jets, but to clean up simple quark or gluon-initiated QCD jets. Suppose we have a massive color singlet 
(j) decaying to qq. Then the mass of the resonance should be reconstructable using all of the final-state 
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Fig. 47: Comparison of different clustering algorithms for the minimum invariant mass of two subjets. 
The Cambridge/Aachen and k± algorithms both work well and finding the subjets corresponding to the 
W decay, while anti-fcj^ fails. The bottom right panel shows that anti-A;^ can still be used to find the fat 
jets, if the jet constituents are then reclustered with Cambridge/Aachen to find the subjets. 



radiation from the quarks. However, in reality, it is impossible to separate the final state radiation from 
additional radiation originating either from the underlying event or from other hard objects that happen to 
also be present. By studying the radiation pattern from FSR only to FSR plus contamination, the authors 
observed that the contamination could be reduced with trimming. 

The trimming procedure they proposed works as follows. First jets are found as usual, say with 
anti-fcx with R = 0.8. Then the jet constituents are reclustered into many tiny jets using a much smaller 
clustering size, say R = 0.2. These tiny jets are then discai^ded if their p^- is below a certain threshold, 

shows an 
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say p^r^^^ < O.Olp^p', or only a certain number of them, say the five hardest, are kept. Figure ■ 
example of how trimming removes contamination from underlying event. Note that in contrast to pruning, 
trimming does not reverse the clustering sequence to find subjets, rather it reclusters with a different scale, 
in a manner similar to the filtering stage in the Higgs analysis described above, but not solely focused on 
that final state. Thus, good results can be obtained using anti-A;^ both to find the original jets, and to find 
the tiny jets during the reclustering step. 

15.4 SUMMARY AND FUTURE WORK 

Let us briefly mention some other related ideas which have been proposed. In |I205|, the authors proposed 
that greater efficiencies could be obtained by choosing a jet radius continuously which decreases with 
increasing pT- Similar scalings had been used previously, for example in the boosted top-tagging 
study |T85 1, but with a discrete size change rather than a continuous one. The insight of ]205[ was to make 



such a scaling automatically part of the jet algorithm. While changing the jet size does seem to produce 
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Fig. 48: Effectiveness of jet trimming at removing contamination from the underlying event. The jet mass 
after trimming appears similar to the jet mass with underlying event turned off. (Figure courtesy of D. 
Krohn.) 



improvements, the benefits may not outweigh the complications in real experimental settings, such as the 
increased difficulty of jet energy scale calibrations. For example, CMS chose in their implementation 
of top-tagging | 204| to keep the jet size fixed, and were able to achieve comparable efficiencies to the 
theoretical study with the R variation. 

Another idea is to use QCD jet shapes such as planar flow or angularities to distinguish heavy particle 
decays from QCD jets [206,207J . A comparison of the efficiency of this approach in top tagging, and a 
more complete summary of other key issues in jet substructure can be found in |208[ . 

In addition there is clearly potential in studying QCD characteristics beyond the jets. Recently, a 
proposal was made of for a simple variable pull which can characterize the color flow of an event |209|. 
Color flow is something that is not contained in the 4-momentum of a jet and so may provide a nice handle 
on new physics which is complementary and uncorrelated with traditional techniques. For example, it 
may help distinguish color singlet objects (such a the Higgs) from the other color structures in the QCD 
background. Although color flow is sensitive to aspects of QCD that go beyond the leading-log, DGLAP 
evolution of basic Monte Carlos, recent progress in the development of Monte Carlos assure us that such 
effects must be both visible and calculable (cf. Section [TO^ of these proceedings). The prospects of 
extracting such superstructure information at the LHC are just beginning to be explored, and there is 
certainly much more to be done and understood. 

In subsequent sections of these proceedings more details of physics relevant to jet substructure are 
discussed: the status of simulations for jet substructure in heavy particle decays (Section 16. 1, detector 
issues (Section [l7.| ), radiation between jets (Section 18. 1 and calculations of boosted bb production in 



Wbb events (Section|12.|), which is a significant background to Higgs searches using jet substructure. 
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16. HEAVY PARTICLE DECAYS IN MONTE CARLO EVENT GENERATORS E3 

In the coming years the LHC will probe a new energy regime one order of magnitude higher than that 
which has been accessed to date. It is therefore natural that heavy, unstable particles, such as the top-quark, 
will be produced in large quantities. Indeed, next-to-leading order QCD predictions for top quaik pair 
production indicate that it pairs will be created at a rate of around 0.3 Hz at low luminosity (10 fb^^ per 
year). Of course, it is widely anticipated that physics beyond the Standard Model (SM) will be discovered 
at the LHC, hence many other species of heavy particles are also expected to be produced. 

It is well known that modem collider physics analyses rely upon Monte Carlo simulations for a wide 
variety of applications, from the modelling of detector effects through to the determination of fundamental 
parameters; the top-quark mass determination being a case where the dependence on simulations is 
particularly acute. Since new, unstable, heavy particles feature in almost all extensions of the Standard 
Model, it is foreseeable that many more analyses of this type will be carried out in the LHC era. Clearly, 
as the role of the event generators in the analysis becomes increasingly significant, so too does the need 
for them to model the underlying process precisely and, when this is not possible, it is equally important 
that their limitations and inaccuracies are understood. 

In the following we shall critically review some of the main issues to be addressed in the simulation of 
heavy particle decays. The material is intended to better inform users of event generators regarding the 
nature of the methods and approximations employed, rather than to provide a comprehensive technical 
review of the state-of-the-art. The first part of the discussion concerns the generalities of the physics 
phenomena and their simulation, rather than a description of a specific event generator. In the second 
part we give brief, itemized, technical descriptions of the three main general-purpose event generators, 
Sherpa ||23|, Pythia 8 |T8l20| and Herwig++ |21 1, with reference to the background material. 



16.1 SIMULATION STRATEGY 

In general, for each event, all of the multi-purpose event generators begin by generating an initial 'hard' 
configuration of particles according to the associated tree-level matrix elements, after which paiton 
showers are initiated from the external legs. When the showering is complete the final- state consists of 
a set of partons with constituent masses which are then hadronized. On the other hand, for processes 
involving the production and decay of unstable particles, including decay chains, rather than attempting to 
generate the ultimate final-state particles directly, according to high multiplicity matrix elements, each 
program first generates the momenta of the initial unstable heavy particles in the production phase, before 
going on to (quasi- )independently generate the momenta of their decay products, showering the colour 
charged objects in each phase. As well as being more manageable, this approach benefits from being 
substantially more efficient, computationally, and also from being highly versatile, in keeping with the 
multi-purpose paradigm that the event generators are built on; in particular, the independent generation 
of production and decay momenta lends itself naturally, through iteration, to the simulation of arbitrary 
decay chains. Of course this generality comes at the cost of varying degrees of approximation, which we 
will now discuss briefly below. 



16.2 THE BASE APPROXIMATION 

In this subsection we elaborate on the series of approximations by which the production and decay of a 
particle may be treated entirely independently modulo charge and momentum conservation. In doing so 
we will arrive at the base approximation: the common starting point from which the simulations may or 
may not start to model omitted physics effects and undo approximations by various means, some of which 
will be addressed in Section [T6. 31 

^'^Contributed by: K. Hamilton, L. Lonnblad, S. Schumann, T. Sjostrand, J. Winter 
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16.21 RESONANT AND NON-RESONANT CONTRIBUTIONS 

Experimental signals are defined in terms of the particles which actually enter the detector, leptons, 
photons and jets, rather than the heavy unstable objects which we typically are interested in studying, 
upstream. The corresponding theoretical predictions are then strictly required to be defined in precisely 
the same way, including contributions from all Feynman diagrams which give rise to the same external 
states, see e.g. | ,210j . Generally, however, only a limited subset of these diagrams involve the exchange of 
s-channel resonances which go on to decay into the stable objects comprising the signal, i.e. diagrams 
representative of heavy particle production and decay (chains). By their nature the resonant graphs give 
the dominant contributions to the cross section for a given signal and calculations based on this subset 
of Feynman diagrams are known to provide excellent approximations to the full result. Moreover, in 
the case where the heavy s-channel particle is that which we wish to study, experimental analyses will 
regularly employ cuts which further enhance the resonant contributions. In keeping with these observations 
the simulation of non-resonant effects in all but the simplest processes, namely 2 — > 2 processes, is 
typically beyond the scope of the multi-purpose Monte Carlo event generators, furthermore, the loss of 
accuracy incurred through the omission of the non-resonant contributions is limited to the extent that this 



approximation is often also employed by tree-level event generators such as Madgraph / MadEvent |81 1 



or Whizard | 211| , even though they have the facility to include them automatically, in order to increase 



computational efficiency. 

16.22 FACTORIZATION OF PRODUCTION AND DECAY 

Although there exists, in general, quantum mechanical interference between the production and decay 
of an unstable particle, in the limit that the resonant particle is exactly on-shell, the matrix elements 
corresponding to the resonant graphs may be factorized into two parts, one corresponding to the production 
of the heavy particle and one corresponding to its decay. Take, for example, a process pp — )• bW^ + X, 
which naturally receives resonant contributions in the form of graphs containing an s-channel top-quark; in 
the limit that the top-quark momentum is on-shell, one can replace the propagator numerator i{pt + rtit) 
with a sum over spins: 

lim M \ A^AlrA'^A^r (28) 

where A^ = A^u\ (pt) and A^ = u\ (pt) A^ are the amplitudes for the on-shell production and decay 
processes, pp ^ t + X and t — bW^ respectively and where the quark helicity indices. A, A' are summed 
over. In the so-called narrow width approximation where the product of the unstable particle's width and 
mass tend to zero we have 

lim \ > ^Hp^ - m^) , (29) 

i.e. in the narrow width approximation the replacement of Eq. ( [28| ) is valid. A completely analogous 
replacement to that in Eq. ( [28] ) holds for vector bosons, in that case, as with the fermion propagator 
numerator, the vector boson propagator numerator can be replaced by a sum over polarizations. 

A further approximation commonly (but not universally) used in event generators is the neglect of 



spin correlations. This amounts to assuming that in the factorized amplitudes in Eq. (28 1, the graphs 
corresponding to different helicity states of the intermediate particle do not interfere with one another 
and that they are produced in equal measure i.e. the full matrix element is effectively replaced by the 
product of two spin summed matrix elements, one corresponding to the production Ai^, the other to the 
decay .In our pedagogical example, working in the narrow width approximation and neglecting spin 
correlations, the exact squared matrix is replaced according to 

lim M 6 (pf - mf) M^M^ . 
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In addition, by inserting dq^ 6 (gf — p'f) and d^pt 6 {pt — Pb — Pw+) into the usual Lorentz invariant 
phase space measure we may, without approximation, factorize the Lorentz invariant phase space, d<I>, for 
bW^ + X into parts corresponding to the same production and decay processes: 



where. 



d$ = d^pd^DT^dq^ (30) 
zvr 



d^p = d^td<^xi2TT)U''ipa+Pb-pt-px), (31) 

d<pD = d^bd^w+ {2Tr)^ 5^ {pt - pb - Pw+ ) , 
and for a given particle i, d^i is the usual invariant phase space measure 

d^^ = ^'f . (32) 

Using this phase space factorization together with the narrow width approximation one can see that the 
differential cross section will factorize into two parts; denoting flux and PDF factors by C (x^, xq) we 
have, 

do" = dx^^dxQ C {x(Q, Xq) d^ M. (33) 

= dxQdxQC{xQ,XQ) d^pM^ X ^dTt 

i t 

dVt = -^X^d^D, 
2mt 

where the first part corresponds to the spin summed production of the unstable particle and the second 
part to its branching ratio, differential in the decay phase space. By appealing to the narrow width 
approximation it is therefore possible to first generate the production and decay processes completely 
independently in their rest frames, then boost the decay products to the frame in which the particle 
was produced. The latter boost is ambiguous up to a rotation which is a direct manifestation of having 
neglected spin correlations. 

The simulation procedure and the approximations which we have described up to this point comprise the 
most basic algorithm employed by Monte Carlo simulations for the treatment of heavy particle production 
and decay; of course, for the parton shower simulations, these tree level configurations will, by default, 
go on to include the effects of soft and collinear parton emissions from the coloured particles. Unless 
otherwise stated this represents the base accuracy and physics describing the interplay of the production 
and decay of unstable particles in these simulations: non-resonant contributions, spin correlations and 
off- shell / finite width effects are all completely neglected. 

16.3 BEYOND THE BASE APPROXIMATION 

Having noted the basic nature of the base approximation we will now briefly review some of the ways in 
which it can be improved on. Note that these enhancements can be extremely sophisticated and so in no 
case should they be assumed to have been implemented as a default option, for the same reason, it should 
be clear from the simulations' documentation whether such features are available or not. 



16.31 FINITE WIDTH EFFECTS 

Although the factorization of the numerator of the unstable particle propagator only holds when it is 
exactly on-shell, the error induced by using the replacement Eq. (28 1 when this is not true is O {F/m) 
i.e. the typical amount by which the particle is off-shell with respect to its mass. Of course, if cuts or 
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observables are such that the channel in question only contributes when the particle is far off-shell, e.g. if 
cuts correspond to a mass-window restricting the invariant mass to be far from the pole mass, the error 
is much greater and, in any case, in such circumstances the inclusion of non-resonant contributions is 
mandatory. Provided the signal regions receive contributions from the immediate vicinity of the resonance 
we may recover its line-shape up to corrections O (F/m), by inserting 1 into the factorized cross section 
formulae as, 

J {^p^ — TTi^j -)- TTi^i ^ 

This is equivalent, of course, to not taking the exact limit F — )• in the Breit-Wigner function in Eqs. (28 1 
and ( p9] ), we stress that this limit is nevertheless implied by assuming that the propagator numerator may 
be replaced by the sum over external wavef unctions. 

In terms of the event generation procedure, the inclusion of finite width effects in this way, leads to a 
trivial modification of that described above for the F = case, namely, that the first step in the algorithm 
is now the generation of an off-shell mass for the intermediate particles according to a Breit-Wigner 
function. The same procedure can be adopted using running widths. 

Although this seems, in principle, a relatively straightforward means of improving the base approxima- 
tion (F = 0), a thorough treatment of the particle width, in particular those involving next-to-leading order 
computations, requires that further, more subtle, issues be addressed before it is introduced anywhere at 
all. Of note is the question of how to ensure that the introduction of the finite width does not violate gauge 
invariance? The particle width term arises from the resummation of the imaginary parts of the self-energy 
insertions on the unstable particle's propagator i.e. it corresponds to the inclusion of a subset of higher 
order corrections to the process under consideratiorp^ Since S-Matrix elements are only guaranteed to 
be gauge invariant at each order in perturbation theory when all relevant diagrams at the given order 
are summed, one should expect that naively adding a width term to the propagator will violate gauge 
invariance. Although this is the case in general, usually the numerical impact of the gauge breaking terms 
is neghgible and is contained within that incurred by writing the propagator numerator as a polarization 
sum. 

On the other hand, gauge invariance underlies the cancellation of large logarithmic corrections to 
inclusive quantities as laid out in the Bloch-Nordsieck |212| and KLN |213 214 1 theorems, therefore, 
in cases where logarithmically enhanced emissions occur, the gauge variant contributions can be hugely 
magnified and even exceed the correct, gauge invariant, contributions. These potentially dangerous 
cases are confined to processes involving diagrams in which the unstable particle we are studying emits 
soft / coUinear massless partons or photons e.g. gluon emission from top quarks or photon emission 
from W bosons. For simulations involving tree level matrix elements and / or parton showers this issue 
is substantially negated due to the obligatory generator level cuts, which exclude the phase space for 
coUinear and soft regions and, also, because the splitting functions at the heart of the parton shower 
dynamics are derived from the universal factorization of matrix elements with soft / coUinear emissions. 
Furthermore, it is clear that for the Lagrangian to be gauge invariant it is not necessary for the parameters, 
in particular the masses, to be real numbers. Bearing this in mind, one may adopt the so-called complex 
mass scheme \2\5\ whereby finite width effects are included in a gauge invariant fashion by simply 
reinterpreting the masses, everywhere that they occur in the Lagrangian, as being complex numbers with 
the iniT width factor absorbed inside them. Alternatively the so-called 'fudge-factor' scheme | 216| is 
employed whereby the total squared matrix element, neglecting finite width effects, is multiplied by the 
product of the Breit-Wigner function and the zero-width propagator, — m?, squared for each resonant 
particle in the process. For further technical details concerning gauge invariance and finite width effects 
see Refs. | |2T5H2T9l . 

Another pertinent issue is that of how to include threshold effects in the running width and the selection 
of decay modes. In the forthcoming years LHC analyses will predominantly be concerned with searches 

^'it therefore also has a dependency on the particle's mass, which we shall omit here for simplicity. 
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for new particles, whose masses couplings and decay modes are all unknowns. A prime example is the 
Standard Model Higgs boson, whose width and decay modes strongly depend on, not simply its on-shell 
mass but, in view of the factorization of the production and decay, also the off-shell mass generated at 



the beginning of the simulation, Eq. (29 1. This is of particular importance for particles decaying close to 
threshold (as the Higgs boson may do), in this case the effects from the off-shell propagator must be taken 
into account. This can be achieved by implementing the running width, i.e. including the full dependence 



on the particle's off -shell mass, in the weight factor in Eq. ( 34 1, as well as, in the calculation of the partial 
width of a selected decay mode as in Ref . |220J , where this prescription is shown to reproduce well the 
results of some three-body decays simulated as a cascade of two two-body decays. 

16.32 SPIN CORRELATIONS 

As stated previously in the base approximation particle decays are performed according to the spin 
averaged matrix element for the decay process hence they occur isotropically in the rest frame. It follows 
that in order to have a reasonable description of observables sensitive to the details of individual decay 
products it will be necessary, at least, to communicate the spin information from the production to the 
decay parts of the simulations. Notable studies for which such correlations are particularly relevant 
include: the production and decay of the top quark, the production of tau leptons in Higgs decays and 
the spin determination of newly discovered particles i.e. understanding the nature of physics beyond the 
Standard Model. 



Generalizing earlier work of Knowles and Collins | |221 - j223| , a flexible, efficient algorithm for propa 



gating spin correlations between particle production and decay in Monte Carlo event generators, using the 
spin density matrix formalism, was developed by Richardson f224] . Rather than discuss the algorithm 
in full detail here we will describe it by considering the example of the process hh ^ ti where the top 
quark subsequently decays, via a W boson, to a 6 quark and a pair of light fermions. Initially, the outgoing 
momenta of the t i pair are generated according to the partonic differential cross section 

d?^ = ^^£A,<A,d$P (35) 

where -^AtAj amplitude for the initial hard process, with \t and denoting the helicity indices 

of the produced t and t respectively. d$p is the usual phase space measure for the production process. 



equal to that in Eq. (31 1, where in this example X = t. Having generated the momenta for the production 
process, one of the outgoing particles is then picked at random, say the top and a production spin density 
matrix is calculated 

with N defined such that has unit trace. 

The top is decayed and the momenta of the decay products distributed according to 

dVt = — p\ V '^\\ -^fA* d$D , (37) 
where the inclusion of the spin density matrix ensures the correct correlation between the top decay 



products and the beam. We note how the spin averaged result, Eq. (33l, is recovered by replacing 
p\^x- ~^ 5 "^^i^- With the momenta for the top quark decay products now in hand we may calculate a 
production spin density matrix for the unstable 

pV A' = Tf P\ A' -^f 'a -^f'T > (38) 



and then generate its decay by analogy to Eq. ( 37 1. Here the use of the production matrix calculated from 
Eqs. ( [36] ) and (38 1 leads to the correct angular correlations between the light fermions, the beam and the 
bottom quark. 
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Since the children of the are light fermions the decay chain terminates and a decay matrix for the 
is calculated: 

Moving back up the decay chain, the analogous decay matrix is calculated for the top quark using the 
decay matrix of the : 

, = l^f' A^,'* Df^ y . (40) 

AtA, AT- AtA,„+ A;A' , Aj^+A„ , ^ ^ 



Following this one contracts Dt^^y with the full tt production spin density matrix, -^x^x- -^v*A'-' ^° 



the production spin matrix of the antitop quark, p\_y , analogous to Eq. (36 1, before generating its decay 

products in the same way as was done for the top quark. In progressing forward along the t decay chain 
the production spin density matrices pass information from one decay to the next leading to the correct 
angular correlations. 

Other methods for the inclusion of spin correlations in event generators are in use, although many 
of these may be regarded as approximations or variants of the above [81J . The only other approach 
which may be regarded as distinct, involves decaying all unstable particles in the event in a single step, 
using matrix elements for the process under study where these particles have and have not decayed. This 
algorithm has been applied in Pythia, for quite some time, for a number of processes and has been recently 
been extended in the context of matched next- to-leading order simulations by Frixione et al[\ 14]. This 
alternative method is based on the observation that the product of the separately spin summed matrix 
elements for the production and decay processes, including propagator factors, always exceeds the full spin 
summed matrix element. Knowing this, spin correlations may be implemented by generating production 
and decay momenta as in the base approximation (with the decays isotropic in their rest frame) choosing 
a random number TZ and then regenerating the decay kinematics if IZ is greater than the ratio of the full 
spin summed matrix element to the spin summed matrix element omitting spin correlations. For a given 
set of production momenta, isotropic decay kinematics and new values of TZ are generated repeatedly 
until TZ is less than the aforesaid ratio, at which point the decay kinematics can be kept. 

Since the underlying Monte Carlo algorithm in this alternative approach is just the hit-or-miss method, 
with no spin density matrices, spinor phases etc, to keep track of, this method is substantially easier to 
implement than the one above. On the other hand this method assumes that a matrix element generator 
capable of providing helicity amplitudes for (arbitrarily) high multiplicity final states is available, moreover, 
the intention here is to simultaneously generate the momenta for all decaying particles, in 'one shot', 
which also impedes the event generation efficiency with respect to the recursive approach based on spin 
density matrices. Nevertheless, for including spin correlations in events with relatively few decays, this 
method offers a simple and straightforward alternative to the spin density matrix approach, where the 
relative inefficiency tends to add little, with respect to other elements of the simulation, to the overall 
event generation time. This method has been used widely in simulations based on the MCSNLO 1 12 114) 
and POWHEG |[T4l|118|[T24l methods. 



16.33 QCD RADIATION 

Having generated the decay momenta, with or without finite width effects and spin correlations, if the 
mother particle or any of the daughter particles carry a colour charge, one should attempt to model the 
effects of QCD radiation which they may emit. The vast majority of these emissions will be soft and / or 
collinear emissions with respect to the shower progenitors. This is straightforward to understand from the 
point of view of the associated amplitudes: in the limit that a massless particle is emitted with low pT, the 
propagator denominator, associated to the internal line that was its mother, vanishes so the matrix element 
diverges for such configurations. 

Such low pt corrections are precisely those which parton shower simulations take into account, to all 
orders in perturbation theory. Of course, distributing emissions according to arbitrarily high multiplicity 
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matrix elements is not feasible, instead we appeal to the fact that, in the limit that a massless parton j is 
emitted coUinear from an external leg i, the matrix elements factorize according to 



M 



n+l 



Svras 



2 



(41) 



where ij denotes the mother of the branching ij — ij, with virtuality q~., and Pf-^^ is an Altarelli-Parisi 
splitting function. The form of the splitting functions does not depend on A4n+i and Ain, where the 
latter is the matrix element for the n particle process, in which the branching does not occur. The matrix 



element factorization in Eq. (41 1 may be combined with a phase space factorization for the branching, 
leading to a factorized form of the differential cross section for the n+l particle process, in a similar 
vein to that described in Section 16.22[ in the context of heavy particle decays. Here, as with cascade 
decays, the factorization may be applied recursively, albeit with the caveat that using Eq. ( [4T] ) implies 
the emissions be ordered such that they become increasingly coUinear as one works from the core of the 
process out toward the external legs. As well as modelling the effects of enhanced coUinear real emission, 
the parton shower method also, necessarily, incorporates their corresponding virtual corrections through 
Sudakov form factors, the effect of which is to dampen, and ultimately regulate, the diverging emission 
rate as pT — )• 0. 

Depending on the accuracy of the parton shower, there is some freedom in the definition of 'increasingly 
coUinear' . Parton shower algorithms may be formulated as an evolution in the virtualities of the branching 
partons, or as an evolution in the transverse momentum of the branching products. However, formal 



studies of colour coherence 1 101 140- 148j reveal that branchings involving soft gluons should be ordered 
in the angle between the branching products. This is a dynamical effect whereby wide-angle soft gluon 
emissions, from near coUinear configurations of two or more partons, have insufficient transverse resolving 
power to be sensitive to the constituent emitters. In effect the resulting radiation pattern is determined by 
the colour charge and momentum of the mother of the emitters, rather than the emitters themselves p°| 

In the LHC era we will regularly face the situation in which one cannot neglect the mass of radiating 
particles when modelling the distribution of their emissions, ti bar production being an obvious example. 
Nevertheless, the Altarelli-Parisi splitting functions governing the distribution of emissions in the older 
generation of shower simulations are based on the factorization of matrix elements where the mother 
parton, ij, and its two daughter partons i and j are all massless. As stated above, in the limit that massless 
partons emit low px radiation the associated amplitudes diverge, however, if the emitter has a mass, 
the propagator denominator is proportional to 1 — /3 cos 6 , where the factor of /3, the velocity of the 
emitter, screens the coUinear divergence. This mass effect in the propagator denominator then greatly 
alters the distribution of emissions as the small angle region is approached with respect to the rather 
crude massless approximation, which will likely have important consequences for analyses e.g. the top 
mass determination, where a good understanding of production radiation and b quark fragmentation are 
key. However, in recent years, developments in the resummation of radiation from massive partons have 
included the introduction of quasi-collinear splitting functions which generalize those of Altarelli and 



Parisi, so as to retain the full dependence on the mass of the emitting particle in the coUinear limit |227 1. 
For example, the quasi-collinear q ^ qg splitting function is 
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+ (1 - z)^ m2 



(42) 



in which z is the light-cone momentum fraction carried by the daughter quark and p^ is the transverse 
momentum of the branching; note how the form of the first term in Eq. ( [42| ) is the same as the usual 
massless q ^ qg Altarelli-Parisi function. The inclusion of these mass effects in parton shower simulations 



' To avoid digressing from our stated objectives we limit ourselves to this minimal description of the parton shower 
approximation and refer the interested reader the following dedicated review articles fT4|[T8] [225|[226) . 
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is actually non-trivial, since the full showering formalism is required to retain mass effects from the 
beginning i.e. in the shower variables, the Sudakov form factors, the cut off scales, the phase space 
mapping and kinematics reconstruction / momentum reshuffling. 

Since, in the narrow width approximation, the production and decay processes are assumed to factorize 
into two independent parts, this is the sense in which we simulate the associated parton showers, effectively 
treating the decay as a completely independent hai^d sub-process. For the decay process the effects of 
QCD radiation are then simply modeled by initiating parton showers from the incoming and external 
particles, in the rest frame of the decaying particle. Naturally then, for the decay products, the same 
final-state showering routines are applied as are used for the hard scattering process, including those used 
in the initialization phase in which starting scales are assigned. Only the initial conditions for the shower 
evolution are different, although their choice is, nevertheless, still based on examining the colour flow in 
the underlying hard decay process. 

The leading order decay products may not, however, be the only particles to emit radiation. Massive 
coloured particles, such as the top quark and particles in many models of physics beyond the Standard 
Model, may decay on time-scales shorter than that characteristic of the hadronization process. Conse- 
quently, subject to the available phase space, as well as undergoing time-like showers (g^ > m^) in 
their production phase, these particles will also undergo a further space-like showering (g^ < m^) of 
QCD radiation prior to their decay. In contrast to the final-state showers from the decay products, the 
initial-state space-like shower created by a decaying particle is quite different to that of an initial-state 
particle from the production process. In particular, it involves no PDFs, since the heavy parton originates 
from a hard scattering as opposed to a hadron. Furthermore, in the hard process it was necessary to evolve 
the initial-state partons backwards from the hard scattering to the incident hadrons, to efficiently sample 
any resonant structure in the underlying matrix elements. On the contrary, in decay processes, degrading 
the invariant mass of the decaying particle, via the emission of radiation, does not affect the efficiency 
with which any resonant structures in the decay matrix element are sampled. Hence, it is natural for the 
evolution of space-like decay showers to start with the unstable particle from the production process, 
and evolve it forward, towards the decay. In the older generation of multi-purpose event generators the 
inclusion of this initial-state radiation in decays was not in general possible due to, for instance, their use 
of non-covariant shower formalism. 



16.34 QED RADIATION 

In existing Monte Carlo simulations the production of QED radiation in particle decays is normally 
simulated using an interface to the PHOTOS program |228 - 230J . This program is based on the collinear 
approximation for the radiation of photons together with corrections to reproduce the correct soft limit 
1 228 229 1 . In recent years the program has been improved to include the full next-to-leading order QED 
corrections for certain decay processes |230 - |232 1 . 

As noted previously, in the event that the decay products are heavy, the divergences associated to 
multiple collinear emission are subject to a screening which increases with the mass of the emitter. On the 
other hand, since the propagator denominator associated to an emitter is also proportional to the energy of 
the emitted particle, soft emissions are always enhanced regardless of the emitting particle's mass or the 
emission angle. This being the case it is often preferable, depending on the application / particle masses / 
analysis cuts, to model the effects of photon radiation using a formalism which resums to all orders the 
effects of soft rather than collinear emissions. 

In QED soft infrared divergences are resummed to all orders using the methods originally developed 
by Yennie Frautschi and Suura [233 1 (YES), in doing so the soft real and virtual corrections are seen to 
exponentiate|^ This resummation has been recast as photon shower simulations through a tour deforce 
in Monte Carlo techniques pioneered by S. Jadach | 236f 241 1. Such simulations have no overlap with 



'Pedagogical examples of the exponentiation of infrared QED corrections may be found in Refs. j234|235j 
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their QCD counterparts; whereas in QCD the shower forms through iterating emissions in a Markovian 
algorithm, the YFS QED shower algorithms are Poissonian, generating all photon radiation at the same 
time in 'one-shot', moreover they have always afforded the possibility to systematically include arbitrarily 
higher-order corrections. 

Finally, we remark that soft resummation in QCD is fundamentally different since, unlike QED, each 
emitted parton is itself charged. Unfortunately such additional complexities arising from the non-Abelian 
character of QCD mean that applying the same Monte Carlo approach to simulating QCD radiation, 
despite its many attractive features, is unfortunately not appropriate. 



16.35 HARD QCD RADIATION 

Note that the standard QCD parton shower approach has two important drawbacks. Firstly, because the 
parton shower generates emissions from each leg of the hard scattering independently, each additional 
emission must be uniquely associated to a particular leg of the hard scattering, which can only be achieved 
at the price of having regions of phase space, corresponding to high pT gluon emissions, which are 
unpopulated by the shower, so-called dead-zones. Secondly, the soft / collinear approximation to the QCD 
matrix elements is plainly not a good approximation all over the phase-space region populated by the 
parton shower. 



One way to rectify these problems is through matrix element corrections \ 24 1 which ensure that the 
hardest additional radiated parton in the event is distributed exactly according to the corresponding 
real emission matrix element. In the case of soft matrix element corrections to decays, every emission 
generated in the shower which is the hardest so far is vetoed if the ratio of the exact differential width 
to the parton shower's approximation to it (based on the soft / collinear approximation to the former) is 
greater than a random number IZ. 

Where the parton shower algorithm is such that there is a dead zone in the phase space of the first 
emission, typically concentrated in the high px region, hard matrix element corrections are also necessary. 
Hard matrix element corrections simply use the same exact real emission matrix element to generate an 
event with a hard emission in the dead zone, with a probability given by the ratio of the integrated cross 
section in the dead zone divided by the total cross section of the leading order process. As a consequence 
of the different ordering variables used in the Herwig / Herwig++ and Pythia shower algorithms, the 
entire phase space for the first emission is covered in the latter, avoiding the need for hard matrix element 
corrections. 

Alternatively one may use matrix element-parton shower matching techniques (ME-PS), such as the 
CKKW method, [4, 126] to correct the approximate soft / collinear radiation pattern of the parton shower 
in the regions of phase space corresponding to hard emissions, where in this case the hardness is typically 
measured in terms of the Durham jet measure. In general these ME-PS separate phase space into two 
regions according to a merging scale, defined in terms of the jet measure, ymerge- The region of phase 
space corresponding to values of y < ymcrgo is deemed to correspond to sufficiently soft / collinear 
emissions that one can expect the shower approximation to reliably distribute emissions there, this is the 
so-called shower region. Above the shower region is the matrix element region, where emissions are 
distributed according to exact fixed order matrix elements. Of course the real emission fixed order matrix 
elements do not include any of the virtual effects resummed in the parton shower and so the emissions in 
the matrix element region must be reweighted with Sudakov form factors and running coupling constants 
to take these important effects into account. Without these corrections distributions will exhibit unphysical 
discontinuities across the phase space partition at ymerge- 

Both the matrix element correction methods and the ME-PS methods effectively take into account the 
real emission component of next-to-leading order corrections but not the virtual effects. Nevertheless, 
there are many examples in the literature (albeit for production processes) where these methods are shown 
to give excellent agreement with next-to-leading order calculations from the point of view of the shapes 
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of distributions | [T20l[T2Tl[T53]4T55l[242l 



16.4 Herwig++ 

• Finite width effects 

Finite width effects are present by default for all processes in Herwig++ through the inclusion 



of a weight factor as in Eq. (34i, retaining the dependence of the width on the off-shell mass of 
the particle, in both the production and decay stages. This approach is used in the simulation of 
cascade decays, where it was shown to give good agreement with results obtained using exact 
matrix elements in Ref. [[220J. 
Spin correlations 



The Herwig++ spin correlation algorithm is precisely that described in Section 16.32 and was 



implemented by the author of Ref. |224 1. 
QCD radiation 

The chief success of the older fortran Herwig 6 program was in its accounting of soft gluon 
interference effects, specifically, in particular colour coherence phenomena, through the angular 
ordering of emissions in the shower. The new Herwig++ algorithm retains angular ordering as 
a central feature of the showering algorithm and improves on it, most notably, in the present 
context, through the inclusion of the mass-dependent splitting functions and kinematics, providing 
a physical description of the radiation distribution emitted by massive particles in the low px 
region | 2T|243[ 244 1. The facility to model initial-state parton showers in the decays of unstable 
coloured particles was envisaged at the start of the Herwig++ project and has been implemented in 
full generality in 2006. 
QED radiation 

The Herwig++ program includes a module which dresses the decays of light and heavy objects. 



involving a single electric dipole, with QED radiation |245|. This simulation uses the YES 
formalism to resum the effects of multiple photon emissions, as well as a Poissonian shower 
algorithm following the methods of Jadach | 236[ . 
Hard radiation 

In implementing colour coherence through angular ordering the Herwig 6 and Herwig++ shower 
algorithms contain dead zones in the phase space for the hardest emission, where the hai^dest 
emission occurs at wide angles with respect to the parent(s). In the context of particle decays 
Herwig++ includes matrix element corrections for top quark decays as described in | 244[ . This 
serves to populate the dead zone for this process and correct the radiation pattern in the live zones. 
Matrix element corrections to W^, Z, and Higgs boson decays are not available, however, due 
to the neutrality of the decaying particle and the low mass of the negligible mass of their decay 
products, these may be implemented with quite a modest effort. 



16.5 Pythia 

• Finite width effects 



All particles that are defined with a width are distributed according to a Breit-Wigner, cf. eq. (34). 
In case of pure s-channel processes, such as e^e~ — )• 7*/^" — )• //, a running width is used, 
in agreement with LEPl conventions. For other processes normally a fixed width is used. The 
inclusion of this width in the matrix-element expressions, especially the interference terms, may 
vary from process to process — Pythia 8 does not have a matrix-element generator of its own, but 
encodes calculations made by many different authors. 
Spin correlations 

Pythia 8 does not come with a spin correlation algorithm, which means that the default is isotropic 
decays. For many processes and decays spin correlations are included with full matrix elements, 
however. That is, for a process such as e+e" — )• W^W~ — )• /1/2/3/4, first kinematics (including 
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masses) is selected for the e+e~ — )• W^W~ process, and thereafter the decay angles of the two 
W's are simultaneously sampled and weighted according to the complete 2 — )• 2 — )• 4 matrix 
elements. For cases where spin is important and not implemented, users are recommended to use 
external input from dedicated matrix-element programs, such as MadGraph | [8T| . 
QCD radiation 



The original Pythia 8 shower algorithm was mass-ordered |246|, with extra angular-ordering cuts. 



while the current one is transverse-momentum-ordered |69 1, with a dipole approach to handling 
recoil effects. One nice feature of these algorithms is that they cover all of the phase space, at 
least for the first emission, thus obviating the need for several of the special HERWIG tricks 
described above, e.g. to handle dead zones or to separate off space-like emissions in resonance 
decays. The modification of the evolution variable from dm?/m? to Am? /{m? — ttiq) gives a 



consistent coverage of the soft/collinear region m? — )■ ttiq for radiation off massive particles 1 247| 



Here rriQ is the rest mass or, for a resonance, the previously Breit-Wigner-sampled mass. This 
approach is also easily extended to the case of evolution in p\ = z{l — z)m?. 
• QED radiation 

The same shower algorithm that does QCD emissions also can handle QED ones, in an interleaved 
manner. That is, QCD and QED emissions can alternate in the downwards evolution in m? orp\, 
and thus compete for momentum. The QCD and QED dipoles do not need to agree. 



Optionally the PHOTOS package may be used |230-232|; by experimentalists often to handle 
QED corrections in hadronic resonance decays. 

• Hard radiation 

The Pythia shower can easily be fixed up to overestimate radiation in the hard region, while attaching 
to the correct expression in the soft one (see above). A Monte Carlo rejection factor can then be 
implemented to match the first gluon emission to the respective QCD matrix -element expression for 
essentially all decays in the SM and the MSSM | 247) . For QED such corrections are only included 
m 7 /Z° and decays. 

16.6 Sherpa 

• Finite width effects 

In Sherpa finite fixed widths are incorporated in the propagator denominators of the internal tree- 



level matrix element generators AMEGIC-i-i- 1 149 1 and COMIX [150J . The fixed width scheme is 
employed within these matrix element generators. 
Spin correlations 

In order to include spin correlations between production and decay Sherpa essentially employs the 
first algorithm described in the corresponding section above. The versatility of the implementation 
is enhanced by Sherpa's ability to automatically generate the matrix elements needed for the 
production and decay spin density matrices internally. Along this line, Sherpa allows on the level 
of hard matrix-element generation for the specification of the production and decay processes in a 
cascade-like manner. As a result contributions from non-resonant diagrams can be omitted while 
one preserves off-shell mass effects and spin correlations in the generation of the fully decayed final 
states. To incorporate spin correllations in the decays of hadrons and r-leptons the simple rejection 
algorithm outlined above is employed. 
QCD radiation 



The new generation of Sherpa versions 1.2.x uses the CSS parton-shower algorithm 1 89 1, which 
is based on (massive) Catani-Seymour dipole subtraction kernels and a dipole-like picture of 
shower evolution. Emissions are ordered in transverse momentum rather than virtuality. The CSS 
implements truncated showering, therefore handles final-state and initial-state emissions in the 
production and decay of unstable coloured particles, respectively. The Sherpa parton shower of 
versions 1.1.x and older, APACIC++ | 248| , is closely related to that of the Pythia virtuality ordered 
shower 11 81. 
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QED radiation 

Sherpa includes a native simulation of QED radiation in decays, PHOTONS++ |249|; like 
the Herwig++ QED shower this module is built upon the YES resummation formalism and the 
Poissonian shower algorithms of the Krakow group 1 236 -241 1. 
Hard radiation 

Here again Sherpa makes use of its internal matrix element generators to generate amplitudes for the 
leading order processes in the presence of additional QCD radiation. The key point is that processes 
of different parton multiplicity in the final state can be combined consistently with parton showers 
(generated by the CSS algorithm) and hadronization included according to the new strategy of 
matrix-element and truncated-shower merging (ME&TS) |65 151 152 1. This approach improves 
over the CKKW method, implemented in versions 1.1.x and older, since it guarantees an unified 
treatment of local scales in the calculation of the matrix elements and parton showers. Hence, one 
finds that the systematic uncertainties of the ME&TS merging are sizeably reduced with respect to 
CKKW. Using the ME&TS matching, extra hard radiation can be reliably described in multi-jet and 
high-pT scenarios. For example, Sheipa employs the new method to include exact real emission 
corrections to the tt production process, as well as the t and t decay processes, in the region of 
phase space above the ME&TS merging scale. 
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17. SENSITIVITY OF QCD JET MASS AND JET SUBSTRUCTURE TO PILE-UP AT LHCjf] 
17.1 Introduction 

New methods to measure the hadronic decays of boosted massive particles at the LHC, Uke W bosons 
p82l , top quarks (e.g., fT85|p6l[250l ), Higgs boson p38l , and BSM particles (e.g., | [T83| ), have been 



proposed in recent years. Due to the boost of these massive particles, the decay products are collimated 
in one direction, and they are reconstructed as one single jet. The properties of these jets, such as 
their masses and jet sub-structure quantities, can be used to separate them from the standard QCD jets. 
Several strategies have been developed to distinguish these two sources and to study the properties of the 



corresponding jets, including the use of jet algorithms with variable size parameters 1 205 1 , jet pruning, 
see e.g. ]193[ , and jet trimming |194| |. 



In this study we investigate the impact of the pile-up on the measurement of these observables. The 
presence of the pile-up interactions affect the low transverse momentum (px) spectrum. We expect that it 
strongly affects the jet energy calibration in the very low px region, becoming negligible for high px jets. 
The estimate of the effects on the jet mass and on the the jet-substructure quantities is less straightforward 
because these quantities are complex QCD observables. 

Another important aspect is the capability to properly reconstruct and calibrate the jets. There are 
several approaches to assess these points. Most of them are finalized at the calibration of the jet as a 
single global object starting from calorimetric signals. In this case, the capability to properly calibrate 
every component (i.e. calorimetric signals) of the jet is not straightforward and the precision on the 
measurement of the substructure should be investigated in a detailed analysis of the detector performance. 
This analysis is beyond our goal. 

In this study only the effects of the pile-up and one simple strategy to suppress it are investigated. 
In order to separate these effects from the detector calibration, we reconstructed jet from Monte Carlo 
generated particles (hadrons). The selection cuts, used to suppress the pile-up, are driven by consideration 
of the detector effects, especially for the reconstruction of low px particles, i.e. the effect of the magnetic 
field bending, energy losses in inactive material, and of the signal selection. 

17.2 Jet reconstruction, observables and selection cuts 

In this study we used a sample of QCD di-jet events in proton-proton collisions with center of mass energy 



10 TeV simulated with Pythia 8.120 1 18 20 1. The generation was divided in different samples 



according to the following bins (GeV) in pr. [17,34], [34,70], [70,140], [140,280], [280,560], [560,1 120], 
> 1120. For the first six samples, 500,000 events were simulated, for the highest px sample a simulation 
of 200,000 events was used. 

The effect of the pile-up is taken in account by adding four (Poisson distributed) additional proton- 
proton interactions per event. These interactions were simulated using the parameter "SoftQCD:aU" in 
Pythia 8.120, i.e. they include elastic, single diffractive, double diffractive scattering and minimum bias 
interactions. 

All the particles generated by the hard scatter and by the pile-up are used to build the final jets. The jet 
algorithm used for this purpose is the anti-fc_L [178] algorithm with distance parameter D = 0.4. Only the 
leading jet is used in this study. It is required to point into the pseudo-rapidity range —3.2 < t] < 3.2. 

The observables under consideration are the jet px, its mass mjet, and the y-scale yscaie 1 182| associated 



with the last clustering step in the formation of the jet. For the Anti-Zcj^ jet algorithm, this last observable 



is not directly available, as it only has a meaningful definition for the k± |251 252 1 jet algorithm. We 
therefore re-clustered the constituent particles of the Anti-fcj^ jet using the k± algorithm, thus allowing us 
to calculate its yscaie- 

The jet px, m^ct, and yscaic are then compared jet-by-jet to the corresponding observables in jets 



^^Contributed by: P. Loch, P. Francavilla 
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which have been reconstructed using only the particles generated by the hard scattering process. Naming 
Ohs+pu the observables in the presence of the pile-up and Ous the observables with only the hard scatter 
particles, the ratio Ohs+pu/Ohs is shown in the following plots as a function of the number of pile-up 
interactions (A^vtx) generated in the event. To properly study these ratios, we should start the jet clustering 
using only the particles generated by the hard scatter. In this paper we report a different way of defining 
the jets. We use all particles (hard scatter and pile-up) to reconstruct the jets and then we look at the 
constituents of the leading jet. From the list of constituents, we use the particles generated by the hard 
scatter to build the reference jet (and to calculate Ors)- This final reference jet is different from the jet 
reconstructed using only the hard scatter particles, but due to the peculiarities of the Anti-fc^ jet algorithm, 
we can expect a rather small difference introduced by this method. 

The observables Ohs+pu depend on the number of pile-up interactions. To suppress the effect of the 
pile-up, which usually adds a set of low px particles to the hard scatter, we used a threshold on the particle 
Pt, thus eliminating low pT particles. This method to suppress the effect of the pile-up is approximate for 
jets reconstructed in the calorimeter, but gives us an estimate of how a selection on the constituents of the 
jets, aimed to suppress contributions from pile-up, could change the dependence of the observables as a 
function of the number of pile-up interactions. We applied four different px thresholds, namely: none, 0.5 
GeV, 1 GeV, and 2 GeV. The order of magnitude of this selection is similar to the effective cut-off that 
affect the charged particles in reaching the calorimeter due to the bending of the magnetic field, as in the 
ATLAS detector | 253| at LHC, and the typical minimum energy required in the central region to generate 



a signal in the calorimeter at all (px = 0.4 GeV). 

These cuts are applied in two ways. The goal of the first one is the estimate of the effect of a pile-up 
suppression after the jet reconstruction. For this purpose the cut is applied to the list of constituents of the 
jet, after its reconstruction. The second way emulates a signal selection prior of the jet reconstruction in 
that the cut is apphed before the jet reconstruction. We refer to the first method as an inclusive selection 
and to the second as the exclusive selection. Due to the difference in the clustering, the reference is 
different in the two cases. 

In the following section the results for the different observables are shown. 

17.3 Results 

17.31 Inclusive selection 

In this section we show some of the results obtained using the inclusive selection discussed above. The 
results are divided in different plots according to the windows in for the generation of the event. 



In Fig 49 the dependence of Pt,hs+pu/pt,hs as a function of number of pile-up interaction (A'vtx) is 
shown. The different colors show the dependence for the selection cuts used in the analysis according 
to the following convention: red for none, blue for 0.5 GeV, green for 1 GeV and black for 2 GeV. The 
different plots show that the variation as a function of A'vtxis smaller for bigger values of the threshold, 
but introduces a bias especially for low px (17-34 GeV). The 0.5 GeV and 1 GeV thresholds seem to be 
the more appropriate, because they reduce the variation on the number of pile-up interactions to less than 
1% while introducing a bias smaller than 1% in the jet energy scale for jets with px > 280 GeV. The 2 
GeV threshold suppresses the dependence on A^vtx> but the bias is bigger than 1% in some region of px 
and a corresponding correction of the jet energy scale would be needed to reach an accurate calibration. 



In Fig 50 the variations of the jet mass mjct are shown in various kinematic bins. These plots show 



similarities with respect to the figure 49 Even in this case, a selection of the constituents reduces the 
impact of the pile-up interactions. The variation due to pile-up in absence of any constituent px thresholds 
cuts is of the order of 20% even for the high px jets, indicating the need for a selection. Again, thresholds 
of 0.5 GeV and 1 GeV are the most appropriate, reducing the pile-up dependence to 10% and 5%, 
respectively, while introducing a respective bias of the order of 5% and 10%. The threshold of 2 GeV, 
which reduces significantly the pile-up variation from 20% to almost none, introduces a 20% bias. 
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Fig. 49: Inclusive selection: comparison of the ratios of Apx/pT = (pt,hs+pu — Pt,hs)/pt,hs as 
a function of the number of pile-up interactions for different windows of px- Here red indicates no 
constituent pT threshold, while blue/green/black indicate thresholds of 0.5/1.0/2.0 GeV, respectively. 



A similar behavior is shown in figure 51 for yscaic- In this case, the 1 GeV threshold shows a good 
suppression of the pile-up dependence, together with a bias of less than 5%. 



17.32 Exclusive selection 

In the case of the exclusive selection, a particle selection cut using the pr thresholds is applied prior to jet 
reconstruction. In this case, the constituents differ from the inclusive selection even for the reference. The 
purpose of this part is the evaluation of an intrinsic selection due to instrumental effects, such as already 
mentioned bending of charged particle tracks in the magnetic field and the effect of inactive material in 
front of the calorimeter As for the inclusive selection, we expect a dependence of the observables on the 
number of pile-up interactions A'vtx- This dependence is reduced by applying the pT based selections. 



A comparison of the inclusive and exclusive selection is shown in Fig 52 This plot shows the ratio 
Pt,hs+pu/pt,hs for the two selections for different px thresholds. Suppressing the low px particles 
already in the reference jet, the bias is no more visible, and only the residual dependence on the pile-up 
activity is shown. This dependence is smaller than the dependence shown in the inclusive plot. 



The results for the jet mass mjet and the y-scale yscaie are shown in Fig. 53 These four plots show that 
the variations of mjet and yscaie are very similar, and 1 GeV or bigger pT threshold applied to the final 
state particles is useful to reduce this dependence. 



17.4 Conclusion 

The studies described in this paper show the dependence of some of the jet substructure observables on 
the pile-up activities. The variation in the jet transverse momentum is less prominent for high pT jets, as 
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Fig. 50: Inclusive selection: comparison of the ratios of Amjet /?7ijet = (w-jet.HS+PU — ''T^jet,Hs)/"^jet,HS 
as a function of A'vtx^ for different windows of pT- Again, red indicates no threshold, while 
blue/green/black indicate jet constituent thresholds of 0.5/1.0/2.0 GeV, respectively. 



expected. In addition, a px threshold cut on the constituents seems to be useful to accurately calibrate the 
jet energy scale for high px jets. 

For the jet mass and the y-scale, the variation introduced by the pile-up is of the order of 2% per pile-up 
interaction. If we consider an activity of between to 10 pile-up interactions per signal event, the total 
variation is of the order of 20%, even for high pT jets. In this case a pile-up suppression is needed and 
applying a pT threshold on the particles seems to reduce the dependence to 5-10% while introducing a 
small bias of 5-10%. 

The effect of the pile-up has to be compared with the experimental capability to reconstruct and calibrate 
the observables under investigation. This part is beyond the goal of this paper and it could be evaluated in 
a realistic detector simulation, which takes in account all the detector effects. The effect of the pile-up and 
its suppression is an important ingredient to be taken into account when using complex QCD observables 
like jet masses and substructure in an LHC analyses. 
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Fig. 51: Inclusive selection: comparison of the ratios of Ayscaie/yscaie = (2/scaie,HS+PU - 
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Fig. 52: Comparison of the effect of exclusive and inclusive selections: the ratios of Apx/pr = 
(pt,hs+pu — Pt,hs) /pt,hs are shown as a function of the number of pile-up interactions A^vtx- The red 
points indicate no threshold, while blue/green/black correspond to 05/1.0/2.0 GeV thresholds, respectively. 
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18. A STUDY OF RADIATION BETWEEN JETS AT THE LHC E] 
18.1 INTRODUCTION 

The pattern of radiation between jets (or between jets and the proton remnant) ought to be very different 
depending upon whether the exchanged state is a colour singlet or colour octet object. Particles are 
expected to flow differently in (p regions both close to and far from the jets. The typical — scale of the 
radiation patterns is also relevant. Traditionally, either due to uncertainties in the theoretical predictions 
or because of the poor granularity of calorimeters used in hadron colliders, few of the fine details of 
radiation have been studied. Events generated by colour singlet or colour octet exchange have therefore 
been distinguished in the past by applying either a cut on the total sum of transverse momentum between 
the two jets | |254 |, or on the transverse momentum of a third jet between the two flSSI. The presence 



of Underlying Event (UE) and especially of pile-up in hadron collisions will give rise to a very small 
efficiency for finding events with a clean gap between the jets. The efficiency may also be very sensitive 
to the details of the specific models. Radiation between jets has to be studied in data, using the most 
powerful techniques to separate the various contributions, and possibly reduce the effect of UE or pile-up. 
Here we present three different classes of observables that can be used for such studies: the distribution of 
radiation outside jets as a function of distance from the jet axis, using different jet algorithms; the "gap 
grid" method, consisting of dividing the detector in several parts along rj and $ to exploit the different 
space distribution of QCD radiation; and the application of a one-dimensional Fourier transformation to 
the energy deposition in the event to highlight specific structures characterised by a given size. 

18.2 RADIATION DISTRIBUTION CLOSE TO THE JET BOUNDARIES 



As a first method, we studied the radiation distribution from events generated with Herwig 6 1 17 1 close to 



the jet boundaries, for various jet algorithms. This shows how the various algorithms differ for average 



QCD events at the LHC. Figure 54 shows the Et-weighted distribution of the Ai? distance of truth-level 
particles from the axis of the leading jet. Using different algorithms with radius (or measure) 0.4, figure 
[54^ shows the distribution of particles belonging to the jet, while 54?) shows particles outside of the jet. 



The Anti-A;^ algorithm shows a clear cut-off at the nominal value of the jet radius, as expected by its 
definition, while the other algorithms have a smoother behaviour. In particular the k_i_ algorithm has the 
largest amount of radiation outside the nominal jet size. 




Fig. 54: Energy flow for QCD colour octet events at a distance Ai? from the jet axis using a) particles 
that are constituents of the jet and b) particles not part of the jet. 



Figure 55 shows the same plots for events in which a colour singlet is exchanged between the two 
leading paitons. Note that whilst 55 1 is broadly similar to 54 1, 55 5 shows less than half the activity of 
54)3 for particles that are not part of the jet. Figure [55^ in fact shows a slightly less sharp jet border at 



R=0.4 for colour singlet exchange because the generally lower amount of radiation in colour singlet events 
means that, in order to pass the jet Et cut, radiation must be drawn into the jet from a wider area. These 
characteristics will be exploited in the techniques described in the next sections. 



Contributed by: M. Campanelli, J. Monk, J. Robinson, C. Taylor 



114 




18.3 GAP GRID TECHNIQUE 

A method often used to distinguish events due to colour singlet and colour octet exchange is looking at 
the total transverse energy in the rapidity range between two jets. Events due to colour-singlet exchange 
(no colour flows between the jets) are characterised by almost no radiation, their golden signature being 
an empty rapidity gap between the two jets. 

It is easy to understand why this quantity is too inclusive to be optimal. Hard radiation from the jet 
itself can emit particles outside the jet boundaries independently of the type of event. On the other hand, 
particles emitted by a colour connection between the jets tend to follow specific geometrical distributions, 
for instance being quite uniform in rapidity and close in (j) to the main jets. To study these differences, we 
divide the detector into various regions according to the directions of the two jets with largest Et. First, 
the area with | A?7| < 0.7 with respect to the axis of the first two jets is removed. The remaining area 
between the jets is divided into 16 regions of equal size in the rj — (p plane: four intervals of equal Ar] 
(each one quarter of the eta difference between the two main jets minus 2 * 0.7) and four of equal A<I> 
(each of 90 degrees), with the <l> = angle defined as the region of the leading jet in the event, and the 
first region being comprised between —45° and 45°. The two detector areas between the jets and the 
beam pipe are also divided in four regions each, according to the same A<I> convention. 

The convention is that the regions between the two jets are labeled between 1 and 16, with regions 1 to 
4 being the closest in eta to the main jet (and ordered such that region 1 contains = 0, region 3 <I> = vr, 
region 2 is the transverse one closer to the axis of the second jet and region 4 the remaining one) 5 to 8 for 
the next r] region in the direction of the second jet, with the same convention with respect to A<I>, etc. 



The average transverse energy deposition in each of the detector areas is shown in Fig. 56 1) for Herwig 6 
colour singlet and colour octet events, the former shown with the Jimmy UE model |54| switched both 
on and off. A clear difference between the various cases is visible, but what is more important is that 
this difference is more marked in some regions than in others. For instance, and despite the cut at it 0.7, 
all three event classes show that both rj regions close to the jets receive more radiation than the central 
regions. This is due to the splash-out from the jets themselves; the two regions with similar to the main 
jets also have more radiation - an effect that is more pronounced for the events with colour octet exchange. 
Dividing the rapidity gap between the two main jets into several smaller regions not only allows a better 
understanding of the radiation patterns (for instance, to compare with different Monte Carlo models), but 
also provides a more powerful separation between singlets and octets. 

The presence of UE is a strong nuisance factor: without it some regions would almost be completely 
devoid of radiation in the case of colour singlet exchange, which would make the identification of such 
events very easy. Although there is no obvious way to identify radiation due to the UE (it is not even 
a fully well-defined concept), it is on average softer than the main scattering process. A possibility to 
exploit this fact would simply be to impose a high threshold for clusters to be included in the density 
calculation; however, it was shown in ||256} that a better approach would be to establish this threshold 
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Fig. 56: The average Et density in 24 different rj — (p regions. Regions 1 to 16 lie between the rj values 
of the leading two jets. Jets with colour octet exchange and UE present are shown as black dashed crosses. 
Jets with colour singlet exchange are shown in red; solid data-points are events with UE present and dotted 
data-points have UE turned off. a) on the left shows the E-r density uncorrected for UE, b) on the right 
shows the effect of applying a correction for UE based upon the area of a jet and the average Et density 
per event. 



on an event-by-event basis. Following this idea, we use the k± algorithm with a small Et threshold to 
cluster into jets all energy depositions in the events. For each jet we calculate the active area, then define 
an Et density as the ratio of the jet transverse momentum and its area. All jets below 10 GeV per unit 
of rapidity squared are considered potential soft candidates; their average density on an event-by-event 
basis is taken as the value of the soft density for the event. For all jets in the event an UE contribution 
is calculated as the product of this soft density and the area of each jet, and subtracted from the jet 
transverse momentum. Finally, all jets whose transverse momentum after this subtraction is below 3 GeV 
are considered as coming from a soft UE-type interaction and are therefore removed. The advantage 
of this method with respect to applying a strong cut on all jets is that the amount of soft radiation is 
determined on an event-by-event basis, so coherent upward or downward fluctuations of the soft activity 
can be accounted for. After this procedure, the density distribution for the various detector regions is 
shown in Fig 565). An improvement with respect to the previous case is visible, with the additional 
advantage of a smaller dependence on the MonteCarlo description of the soft interactions. 



18.4 FOURIER TRANSFORMATION 

The problem of separating the colour connection effect from the UE or pile-up is one of separating features 
of differing physical size in the event. The UE fills the whole detector with radiation at all rj. Colour 
connection effects between jets are typically smaller than this size, i.e. are approximately the size of the 
jet-jet or jet-beamline interval. Hadronisation and showering effects can be expected to be of a similar to 
smaller size. The hard jets will be smaller still, with a radius of i? ~ 0.5 and there may be jets originating 
from softer partons with R as small as 0.1. 

A Fourier-transformation of the spatial distribution of radiation in the event could separate the various 
scales, allowing an alternative reading of the energy flow. For the 1 -dimensional case described here, we 
use the <I> distribution of the truth-level particles for each event, weighted by transverse energy summed 
in 32 bins covering the region O-vr. The axis of the hardest jet is taken as as the $ = direction. We 
then extract and use as our variables the first 32 complex coefficients of the Fourier expansion of this 
distribution. Since these coefficients are linked by the relation Cn = Cn-u (where N=32), there are only 
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32 independent real coefficients, indicating no information is lost (or gained!) on the event-by-event basis 
in going from the 32 bins to the 32 coefficients. 

To test the effectiveness of this method, we applied it to the problem of separating colour singlet 
and octet exchange in hadronic collisions. Colour singlet exchanges without UE are possibly the most 
di-jet-like kind of events in a hadron collider. Since the <l> = direction is aligned to the axis of the 
leading jet, most of the remaining radiation will be concentrated around $ = vr, which is the most likely 
location of the second jet. It is easy to show that odd coefficients can never have a peak at both and vr. 
Further, the n*'* coefficient corresponds to correlated activity of size r ~ vr/n. Small-n odd coefficients, 
corresponding as they do to large non-di-jet features, are therefore expected to be suppressed in perfect 



di-jet events. This is exactly what is observed in Fig. 57 where the average magnitude of the various 
coefficients is plotted for the two extreme cases of a set of colour-singlet exchange without the UE, and 
colour octet exchange with UE. 
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Fig. 57: The average magnitude of the first 16 Fourier coefficients for colour singlet without UE (red- 
dashed) and colour octet with UE (black-solid). 



The suppression of the odd coefficients is clearly visible in the first case, and is much less pronounced 
(even if still present, since the events nevertheless contain di-jets) for the colour octet exchange case. 
Although there is also a difference between the even coefficients (especially at lower-n), it is much less 
pronounced than for the odd coefficients. In particular, we expect coefficients 3 and 5 to originate from 
structures of a similar size (tt/S ~ 1) to the inter-jet radiation. Adding the UE does not change this picture 
significantly, since the uniform UE radiation will only influence the magnitude of coefficient zero, leaving 
the mean values of the others almost unchanged. This is shown more clearly in Fig 58 which shows the 
distribution of the magnitude of the 3rd coefficient for colour singlet and colour octet events. There is a 
clear difference between the two types of event and the presence or absence of UE makes little difference 
to the shape of the distribution from colour singlet events. 

Another potentially interesting variable to look at is the number of the largest coefficient for each event, 
which shows the scale of the dominant feature in the event. Figure 59 shows the distribution of the largest 
imaginary coefficient in the event, for the two classes of colour exchange, both with UE on. 

While for ordinary QCD events this distribution is peaked at low values, this is not the case for colour 
singlet events, where there is a peak around coefficient 8 that corresponds to radiation concentrations of 
size 7r/8 ~ 0.4, roughly the size of a jet. Further, in the colour singlet sample there is a relative depletion 
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Fig. 58: Distribution of the magnitude of the 3rd coefficient for colour singlet without UE (red-dotted), 
colour singlet with UE (red-solid) and colour octet with UE (black-dashed). The addition of UE makes 
little difference to the colour singlet distributions, whilst there is a clear difference between colour octet 
and colour singlet. 



of events in which the largest coefficient was either 3 or 5. More details on the Fourier transformation 
method can be found in ||257j . 

CONCLUSIONS 

Radiation between jets is a key topic for understanding the interplay between hard and soft QCD, and will 
be the object of many measurements with the first LHC data. Here we propose new methods to study 
this radiation in more detail than has been done in the past. The first measurements of these observables 
will offer interesting handles on the discrimination of the various theoretical models. To show the strong 
and weak points of the methods we applied them to the problem of separation between colour singlet and 
colour octet exchange events, but the techniques presented here are more general, and can be applied to 
most of the fuUy-hadronic final states produced by the first LHC data. 
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Fig. 59: The largest imaginary coefficient in the event. Solid black shows colour-octet exchange events 
and dotted-red shows colour singlet exchange. Both event samples have the underlying event model turned 
on. The colour octet sample shows a peak towards lower n, indicating more events have their inter-jet 
activity dominated by features that are large inr] — (p. The colour singlet sample, on the other hand, shows 
more events whose inter-jet radiation is dominated by features of size r ~ S/tt. 
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Part VI 

BEYOND THE STANDARD MODEL 

19. AN UPDATE OF THE PROGRAM HDECA\0 
19.1 INTRODUCTION 

The search strategies for Higgs bosons at LEP, Tevatron, LHC and future e+e^ Unear colliders (LC) 
exploit various Higgs boson decay channels. The strategies depend not only on the experimental setup 
(hadron versus lepton colliders) but also on the theoretical scenarios: the Standard Model (SM) or some of 
its extensions such as the Minimal Supersymmetric Standard Model (MSSM). It is of vital importance to 
have reliable predictions for the branching ratios of the Higgs boson decays for these theoretical models. 

The current version of the program HDECAY |258l can be used to calculate Higgs boson partial decay 
widths and branching ratios within the SM and the MSSM and includes: 

- All decay channels that are kinematically allowed and which have branching ratios larger than 10^^, 
i.e. the loop mediated, the three body decay modes and in the MSSM the cascade and the supersymmetric 



decay channels |111 259 260 1. 



All relevant higher-order QCD corrections to the decays into quark pairs and to the loop mediated decays 



into gluons are incorporated |261 1. 



- Double off-shell decays of the CP-even Higgs bosons into massive gauge bosons which then decay into 
four massless fermions, and all important below-threshold three-body decays | 262[ . 



- In the MSSM, the complete radiative corrections in the effective potential approach with full mixing 
in the stop/sbottom sectors; it uses the renormalization group improved values of the Higgs masses and 
couplings and the relevant next-to-leading-order corrections are implemented | |263 -f266 ] . 



- In the MSSM, all the decays into SUSY particles (neutralinos, charginos, sleptons and squarks including 
mixing in the stop, sbottom and stau sectors) when they are kinematically allowed 1 267 - 269) . The SUSY 
particles are also included in the loop mediated 77 and gg decay channels. 

The source code of the program, written in FORTRAN, has been tested on computers running under 
different operating systems. The program provides a very flexible and convenient use, fitting to all options 
of phenomenological relevance. The basic input parameters, fermion and gauge boson masses and their 
total widths, coupling constants and, in the MSSM, soft SUSY-breaking parameters can be chosen from 
an input file. In this file several flags allow switching on/off or changing some options [e.g. choosing 
a paiticulai" Higgs boson, including/excluding the multi-body or SUSY decays, or including/excluding 
specific higher-order QCD corrections]. 

19.2 UPDATES 

Since the release of the original version of the program several bugs have been fixed and a number of 
improvements and new theoretical calculations have been implemented. The following points summarize 
the most important modifications of HDECAY after its release: 



- Implementation of Higgs boson decays to gravitino + gaugino |270|. A flag in the input file controls 
whether these decay modes are taken into account or not. 



- Inclusion of SUSY-QCD corrections in neutral MSSM Higgs decays to 66 1 27 1 272) and resummation 



of Aft effects | 273[ 274) up to NNLO |275 276 1. The corresponding Aft terms have also been included in 



charged Higgs decays — )• tb. 

Contributed by: A. Djouadi, J. Kalinowski, M. MUhlleitner and M. Spira 
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- Determination and inclusion of the RG improved two-loop contributions to the MSSM Higgs self- 
interactions. These two-loop corrections extend the results of Ref. | 263[ for the stop and sbottom 
contributions for arbitrary mixing parameters and mass splitting, where they have only been displayed 
for the scalar CP-even Higgs mass matrix. We have checked explicitely that we reproduce the results of 
Ref. 12631. 



- An interface to the SUSY Les Houches Accord (SLHA) |43 44 1 has been implemented properly. This 
required several transfoiTnations of the corresponding renormalization schemes to the ones used by 
HDECAY. This option can be switched on and off by appropriate flags in the input file. The output file in 
the SLHA format can also be used again as input file. Moreover, the automatic generation of the SLHA 
input file according to the input parameters of HDECAY is provided as an option. 

- Inclusion of the full mass dependence of the NLO QCD corrections to the quark and squark loop 
contributions to photonic Higgs decays (/>—)• 77 |277 -282|. This decay width can now also be used to 
determine the production cross sections of Higgs bosons at photon colliders at NLO QCD. 

- Inclusion of electroweak corrections to the SM Higgs boson decays H — ^ 1^(*)vf(*) /z^*'>Z^*'^ — ^ 4/ 
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Fig. 60: Ratio of the partial widths of H 



4/ from HDECAY and 



PR0PHECY4F |283 284]. 



in approximate form which reproduces the full results of Refs. |283 284 1 within 1% as can be inferred 
from Fig. 60 where the ratios of the partial decay widths into W^W^ and Z^*^Z^*'> are shown as a 



function of the Standard Model Higgs mass. The improved Born approximation of Ref. 1 283] 284 1 has 
been implemented with additional improvements at the WW and ZZ thresholds. 



The logbook of all modifications and the most recent version of the program can be found on the web 
page http : //people . web . psi . ch/ spira/proglist . html. 
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20. IMPLEMENTATION AND VALIDATION OF MODELS BEYOND THE STANDARD MODEL 

WITH FEYNRULESE!] 
20.1 INTRODUCTION 

Monte Carlo event generators play an important role in making reliable predictions for the events to be 
observed at collider experiments, both to describe the backgrounds and possible candidate signals. In 
particular, the simulation of a hadronic collision requires not only an accurate description of the underlying 
hard scattering process, but also of subsequent parton showering and hadronization as efficiently provided 
by programs such as Herwig ]21|136|, Pythia p^pO| and Sherpa | |22|23) . As regards the generation of 
the hard matrix element itself, a lot of effort has gone into the development of several multipurpose matrix 
element generators, such as CompHep/CalcHep [ |285f|287 ], MadGraph/MadEvent [ 80l81|288|289| , 



Sherpa and Whizard [ |211[|290J . Even though these programs are in principle able to generate the 



(tree-level) matrix element for any process in the framework of a given renonnalizable quantum field 
theory built on scalar, vector and fermion fields, the implementation of a full Beyond Standard Model 
(BSM) theory can be a tedious and error-prone task, often requiring the implementation of one vertex at 
the time following the conventions specific to each code. 

In this section a summary report on the implementation and validation of BSM models into multi- 
purpose matrix element generators is presented. The starting point of our approach is FeynRules | 291| , 



a MATHEMATlCA(gp^package that allows to compute Feynman rules from a Lagrangian in an automated 
way. Furthermore, FeynRules contains a set of interfaces to various matrix element generators, 
allowing to implement the model into a given tool in an automated way. For the moment, interfaces to 
CompHep/CalcHep, FeynArts/FormCalc |292ff297| , MadGraph/MadEvent and Sherpa are 
availablePl A first step in the direction of deriving Feynman rules automatically starting from a model 



Lagrangian has been made in the context of the CompHep/CalcHep event generator with the LanHep 



package |298 1. Our aim is to go beyond this scheme and create a general and flexible environment where 
communication between theorists and experimentalists in both directions is fast and robust. First, the use 
of Mathematica as a working environment provides a powerful and user-friendly platform where BSM 
models can be developed and implemented. Second, the possibility to export the model to more than one 
matrix element generator enhances the chances that the model can be successfully dealt with by more 
than one code. Furthermore, since many of these matrix element generators are already embedded in 
the experimental softwares, the new models can easily be integrated into the experimental framework, 
allowing in this way for an efficient communication between theorists and the experimental community. 



As the starting point of our approach is FeynRules, we briefly recall its basic features in Section 20.2 



and discuss recent developments triggered by the activities and the discussions at the Les Houches 



workshop. In Section 20.3 we present a proposal of how new BSM models can be easily validated 



exploiting the fact that once a FeynRules implementation is available, the model can easily be exported 



to various matrix element generators. In Section 20.4 we discuss the implementation of several new 



models that were implemented and/or validated as an outcome of the workshop. 



20.2 RECENT DEVELOPMENTS IN FEYNRULES 

FeynRules is a Mathematica package that allows to compute Feynman rules directly from a La- 
grangian in an automated way. The user provides the Lagrangian for the model (written in Mathematica) 
as well as all the information about the particle content and the parameters of the model. This information 
uniquely defines the model, and hence is enough to derive all the interaction vertices from the Lagrangian. 
FeynRules can in principle be used with any model which fulfills basic quantum field theoretical 
requirements (e.g., Lorentz and gauge invariance), the only limitation coming from the kinds of fields 

^^Contributed by: L. Basso, F. Braam, C. Duhr, B. Fuks, A. Martin, J. Renter, T. Roy, S. Schumann 
^^Mathematica is a registered trademark of Wolfram Research, Inc. 
^^An interface to WHIZARD will be available in the near future. 
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supported by FeynRules. As to date, the public release of FeynRules supports scalar, vector, fermion 
(Dirac and Majorana) and spin-two fields, as well as Faddeev-Poppov ghosts, and very recently, also 
Weyl fermions have been includecj^ as will be described at the end of this section. In a second step, the 
interaction vertices obtained by FeynRules can be exported to various matrix element generators by 
means of interfaces provided by the package ['2991. Let us note that even though FeynRules itself can 
in principle be used to obtain the Feynman rules for any Lagrangian, the matrix element generators very 
often have certain information on the color and/or Lorentz structures hardcoded. In this case the interfaces 
check whether all the vertices are compliant with the structures supported by the corresponding matrix 
element generator, and if not, a warning is printed and the vertex is discarded. Each interface produces at 
the end a (set of) text file(s), often consistently organized in a single directory, which can be read into the 
matrix element generator at runtime and allows to use the new model in a way similar to all other built-in 
models. 

As already mentioned, the most important recent development concerns the possibility to write La- 
grangians in terms of two-component Weyl fermions. In the following we give a brief description how 
this new feature can be used inside FeynRules. Weyl fermions can be declared in the model file in 
exactly the same way as other particle classes, e.g., 

W[l] == { 

ClassName -> chi, 

Chirality -> Left, 

Self Con jugate -> False}, 



W[2] == { 

ClassName -> xibar, 

Chirality -> Right, 

Self Con jugate -> False}. 

The particle class W refers to Weyl fermions, and we have defined in this way one left-handed and one 
right-handed Weyl fermions named chi and xi, respectively. Let us note that all other options allowed 
for fields are also available for Weyl fermions (See the FeynRules manual [300]). Along the same lines, 
there is a new option for Dirac fermions specifying their left and right-handed Weyl components, e.g., 

F[l] == { 

ClassName -> psi. 

Self Con jugate -> False, 

WeylComponents -> {chi, xibar}}. 

More details on the use of Weyl fermions in FeynRules will be presented in a forthcoming publication. 
Here it suffices to say that since matrix element generators work with four-component spinors rather than 
Weyl fermions, FeynRules replaces at runtime all Weyl fermions by the corresponding Dirac fermion 
V' = (X) 0^ according to the prescription, 

x^^^^, i^^^r, i^A^^. x^A^r, 

where ip'^ = C = x)^ denotes the charge conjugated Dirac spinor. After these replacements, the 
Lagrangian is expressed completely in terms of four-component Dirac and Majorana spinors, and the 
Feynman rules can be exported to the matrix element generators in the standard form. 

The two-component formalism we have just described can be extremely useful when dealing with 
supersymmetric extensions of the SM, where Weyl fermions appear as the natural fermionic degrees 

^^This feature is still beta and will be made public with the next major release. 
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of freedom of chiral and gauge superfields. Let us note that this is just the first step of a more general 
project which aims at implementing superfields directly into FeynRules. In particular, we applied 
the Weyl-fermion formalism to rewrite the existing MSSM implementation in FeynRules | |299[|300[ 



completely in terms of two-component spinors, as well as to the implementation of the minimal R- 



symmetric supersymmetric extension of the SM described in Section 20.43 The validation of these (and 



other) new model implementations was done following the procedure described in the next section. 
20.3 VALIDATION PROCEDURE FOR BSM MODELS 

The requirements for a BSM model implementation that ensure a fast and efficient communication 
between all the actors involved in the simulation chain, both on the experimental and theoretical side, are 
manifold. The first obvious requirement is that the implementation produces reliable results when used 
with a matrix element generator, both from the perspective of the theoretical consistency of the model 
{e.g., gauge in variance,...), as well as regarding the technical aspects of the matrix element generator 
under consideration {e.g., conventions used in the code,...). A robust validation procedure must hence 
combine these two aspects, which however rely on two disconnected fields of expertise: if theoretical 
consistency is purely related to the physics content of the model, testing the technical aspects of the 
implementation requires for example detailed knowledge of the programming language used in the matrix 
element generators. Second, to ensure traceability both of the implemented models as well as of the 
generated event samples, every implementation of a BSM model into a matrix element generator must be 
clearly documented so that all the information about the physics content of the model and the choice of 
the parameters can be retraced at any point in the chain. 

In Refs. | 210[|299 |, several BSM implementation were validated by comparing the results obtained with 



different matrix element generators among themselves. In this section we propose an evolution of this 
strategy for the validation of a generic BSM model implementation, and we argue that the FeynRules 
approach offers a natural framework where BSM models cannot only be easily developed and implemented, 
but can also be validated to an unprecedented level. Since the FeynRules model files are built around 
the Lagrangian of the model, all the information about the physics content of the implementation is 
centered in one place and can be accessed at any point in the simulation chain, without being obscured by 
the conventions imposed by a specific matrix element generator. Furthermore, one can exploit the fact 
that FeynRules offers the possibility to implement the model into various generators and that different 
codes use different conventions and/or work in different gauges. In this way one can very easily check the 
sanity of an implementation by cross-checking the results obtained with different codes. For example, 
once a FeynRules implementation is available, the Feynman rules can be obtained automatically and 
then used to cross-check analytical results for simple processes against known results from the literature, 
a step that could even be automatized by using the FeynRules interface to FeynArts/FormCalc. 
After the model has passed successfully all of these tests, a more detailed study of the implementation 
can be performed by comparing the results of different matrix element generators for a large number 
of (tree-level) processes. In this way, it is not only very easy to show that the implementation produces 
reliable results for all matrix element generators involved, but it also provides a very natural and fast way 
to test the theoretical consistency of the model, by computing the results in different gauges, testing the 
high-energy behavior of certain processes, etc. 

To quantify the level to which a given BSM implementation has been validated, we propose in the 
following a four-step procedure to rate BSM model implementations regarding their validation. In this 
scheme, every implementation is awarded one credit for each completed step, with a maximum of four for 
models tested to a very high extent in all matrix element generators. More explicitly, the four steps are: 

1. Documentation [First credit] : A first credit is awarded if the implementation is documented to 
ensure traceability and reproducibility. Information about the model should be included in the 
FeynRules model file via the M$ModelInf ormation variable (See the FeynRules manual), 
and should contain a twofold information: 
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• All references to the original publications where the model was first presented should be 
included, to ensure that the Lagrangian, its field content and the relations between parameters 
can be traced back at any time. This also includes references to other codes involved in 
the writing of the model file, e.g., spectrum generators. In case the implemented model is 
incomplete or consists only in a theory fragment, this should be clearly stated. 

• The operating system (Linux distribution, Mac OS, ...) and the versions of M athematica 
and FeynRules (as well as all other codes, if relevant) used to write the model file must be 
provided in order to ensure reproducibility at later times. 

2. Basic theory sanity checks [Second credit] : A second credit is awarded for testing that the 
model satisfies the constraints imposed by quantum field theory, e.g., checking the hermiticity and 
gauge invariance of the implemented Lagrangian. If the Feynman rules for the model are known 
in the literature, they should be compared to the results obtained by FeynRules for this model. 
Furthermore, simple two-to-two cross sections and/or decay rates can be easily computed either by 
hand or by means of FeynArts/FormC ALC and the corresponding FeynRules interface and 
compared to known analytic results (if available). 

3. Testing one matrix element generator [Third credit] : After the theoretical sanity of the imple- 
mentation has been checked, a more detailed validation of the model can be performed by exporting 
it to a matrix element generator, both to exclude mistakes in the implemented model as well as to 
ensure that the implementation can be used in a reliable way with a given code. 

• Basic processes like Drell-Yan and Bhabba scattering can be used to check that the matrix 
element generator produces reliable results for the corresponding cross-sections, followed by 
a more systematic study of several two-to-two and/or two-to-three key processes of interest 
in this model. A particular emphasis should be brought to the running of the strong and/or 
electromagnetic couplings, as well as to the reproduction of known SM cross section results 
for the sectors of the model unaffected by the BSM physics. 

• In case an independent (partial) implementation of the model is available, a comparison of the 
two implementations should be performed. 

• The high-energy behavior of the model can be easily checked numerically to ensure that all 
gauge and unitarity cancellations take place in a numerically stable and reliable way. Note that 
for such cancellations to take place exactly at tree-level, it is required that all particles have 
zero width, i.e., all widths of all particles should be put to zero while performing these tests. 

• Some matrix element generators, like for example CalcHep, offer the possibility to choose 
between different gauges. In this case gauge invariance can be easily tested by running the 
same processes in different gauges. 

• The outcomes of the previous steps, i.e., the numerical results for the cross sections for all the 
tested processes, should be summarized and made public together with the model file(s) for 
future reference. Let us note that in order to ensure reproducibility, it is important to include 
the center-of-mass energy and all final state cuts, as well as the version number of the matrix 
element generator involved. 

4. Testing several matrix element generators [4th credit] : Since FeynRules offers the possibility 
to obtain implementations into several matrix element generators, it is very easy to repeat the 
previous steps for more than one program. In this way a very sensible validation of the model can 
be achieved, by exploiting for example the fact that different codes may work in different gauges. 
Again, to ensure reproducibility, the results should be summarized and made public together with 
the model. 
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Process 


MG (FR) MG (ST) 


CH (FR) CH (ST) 


SH (FR) SH (ST) 


e+e" ^ Zh^ 
W+W H+ H- 

UU —7- 

UU — )• U5U2 

J Y 1- ~ 

00 XiX2 

bi X2X3 
Zl ^ 1 2 It 
gW- d^ul 

77 xtXi 


8.787e-3 8.788e-3 
3.689e-2 3.686e-2 
2.123e-3 2.123e-3 
6.141e-l 6.142e-l 
9.556e-2 9.545e-2 
3.981e-2 3.971e-2 
1.712e-2 1.711e-2 
2.569e-l 2.566e-l 
6.250e-l 6.263e-l 
5.235e-2 5.235e-2 


8.787e-3 8.787e-3 
3.685e-2 3.685e-2 
2.123e-3 2.123e-3 
6.141e-l 6.141e-l 
9.556e-2 9.556e-2 
3.977e-2 3.977e-2 
1.712e-2 1.712e-2 
2.566e-l 2.566e-l 
6.257e-l 6.257e-l 
5.236e-2 5.236e-2 


8.788e-3 8.788e-3 
3.685e-2 3.685e-2 
2.123e-3 2.123e-3 
6.141e-l 6.141e-l 
9.555e-2 9.555e-2 
3.977e-2 3.977e-2 
1.712e-2 1.713e-2 
2.565e-l 2.565e-l 
6.257e-l 6.255e-l 
5.238e-2 5.238e-2 



Table 7: Cross sections for a selection of production processes in the MSSM scenario SPS la. The 
built-in MSSM implementation in CalcHep, MadGraph and Sherpa are denoted MG (ST), CH (ST) 
and SH (ST), respectively, while the FEYNRULES-generated ones are MG (FR), CH (FR) and SH (FR). 
The center-of-mass energy is fixed to 1200 GeV. 



The rating scheme we just described will be applied to the model database on the FeynRules website. 
The FeynRules model database offers to every user the possibility to make their model files accessible 
to the community. Every model is assigned a personal web page where the model can be described and all 
relevant files can be made public for download. By applying the described rating scheme to the model 
database it will be possible to progressively build a database of robust BSM model implementations that 
can be used in a large variety of different matrix element generators while still assuring traceability and 
reproducibility at any time. 

Let us conclude this section by illustrating this validation procedure on the example of the FeynRules 
implementation of the MSSM. After having implemented the model in FeynRules, we have compared 
the Feynman rules computed by FeynRules to those which can be found in the literature, both for the 
general MSSM p0ip02) and for a constraint MSSM where all the scalar mixings are neglected | |303p04) , 



and we have found agreement for all the vertices. Then, we have re-calculated all tree-level squark and 
gaugino hadroproduction helicity amplitudes in the case of general (and possibly complex) scalar mixing 
with the help of FeynArts/FormC ALC and the corresponding model file generated by FeynRules. 



The results have been compared to the analytical formulas given in Refs. |305 306 1 and we have found 
complete agreement. In order to validate the FEYNRULES-generated model files for the various matrix 
element generators, we have considered the very particular limit of the typical minimal supergravity point 



SPS la 1 307 1 and compared the cross section for 626 two-to-two processes in CalcHep, MadGraph 
and Sherpa to the results obtained by the built-in MSSM implementations in CalcHep, MadGraph 
and Sherpa. For all the tested processes we have found perfect agreement. Note that MadGraph 
and Sherpa work in unitary gauge exclusively, whereas CalcHep employs Feynman gauge for QCD 
processe^^ In this way we have demonstrated explicitly that our implementation is gauge invariant (at 
least in the strongly interacting sector). Furthermore, we have payed particular attention to weak boson 
scattering processes to ensure that all unitarity cancellation at high energy take place correctly. A very 
short selection of the results we obtained are shown in Table [7] The full list of tested processes, together 
with the results for the cross sections, is available from the FeynRules model database. 



^'Let us note that for the electroweak sector, CalcHep can in principle work both in unitary or Feynman gauge. At present, 
our implementation is performed in unitary gauge only for the electroweak sector. 



126 



20.4 RECENTLY IMPLEMENTED MODELS 

20.41 THE MINIMAL B- L MODEL 

In this section we discuss the implementation of the so-called "pure" or "minimal" B — L model (see 
Ref. ]308 | for conventions and references) that features vanishing mixing between the two U{1)y and 
U{\) B-L gauge groups. The classical gauge invariant Lagrangian of this model obeys the gauge symmetry, 

SU{3)c X SU{2)l X U{1)y x U{1)b-l , (43) 

and can be decomposed as £ = £y a/ + £« + £/ + £y . The non-Abelian field strengths in Cy m are the 
same as in the SM whereas the Abelian ones can be written as follows, 

^YM = - \f'>^'FI, , (44) 

where F^^/ = d^B,^ — d^B^j^ and F'^^j = — dyB'^. In this field basis, the covariant derivative reads, 

= 5^ + igsT'^G^^ + igT'^W^'^ + ig^YB^ + i{gY + g[YB-L)B'^ . (45) 

For the "pure" or "minimal" B — L model the condition g = holds, implying that there is no mixing 
between the B — L Z' and SM Z gauge bosons. 

The fermionic Lagrangian (where k is the generation index) is given by, 

3 

C-f = ^ QkLlfiF'f'qkL + i UkRltiFl^'ukR + i dkR^t^D^dkR + 
k=l 



+i hhltiD^lkL + i ekBlfj-D^ekB^ + i i'kR'y,iF>'^J^kRj , (46) 

where the charges of the fields are the usual SM and B — L charges (i.e., B — L = 1/3 for quarks and 
—1 for leptons with no distinction between generations, hence ensuring universality). The B — L charge 
assignments of the fields as well as the introduction of new fermionic right-handed heavy neutrinos (ur's) 
and a scalar Higgs field (x, charged +2 under B — L) are designed to eliminate the triangular B — L 
gauge anomalies and to ensure the gauge invariance of the theory (see Eq. (49 1), respectively. 

The scalar Lagrangian reads, 

Cs = (D^H)^ D^H + (Z?^x)^ D^X - V{H, x) , (47) 
with the scalar potential given by 

V{H,x) = -m^H^H - /i^ I X P +\i{H^Hf + A2 | X 1^ +\zH^ H \xW (48) 

where H and x are the complex scalar Higgs doublet and singlet fields, respectively. 
Finally, the Yukawa interactions are. 



Cy = -y% QjLdkRH - QjLUkRH - Vji, IjiekRH - y^^ IjL^^kRH - {vRfjVkRX + h-c , (49) 

where H = ia'^H* and i, j, k take the values 1 to 3, where the last term is the Majorana contribution and 
the others the usual Dirac ones. 

The implementation of the model into FeynRules is straightforward. Only the neutrino sector is more 
complicated and needs to be suitably rewritten to manifestly preserve gauge invariance. As a first step we 
rewrite Dirac neutrino fields in terms of Majorana ones using the following general substitution, 

D 1-75 ,1+75 

= — 7^ l^L H l^R , (50) 
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where is a Dirac field and i'l{R) its left (right) Majorana components. If we perform the substitution 



of Eq. ( 50 ) in the neutrino sector of the SM, we will have an equivalent theory formulated in terms of 



Majorana neutrinos consistent with all experimental constraints. Furthermore, from Eq. (49 1 we obtain 
the neutrino mass matrix, 

M=( (51) 



with ^ 

mD = ^v, M = V2y^^x, (52) 

where x is the Vacuum Expectation Value (VEV) of the x field. This matrix can be diagonalized by a 
rotation about an angle a,y, such that, 

tan2a^ = ■ (53) 

For simplicity we neglect the inter-generational mixing so that neutrinos of each generation can be 
diagonalized independently. We also require that the neutrinos be mass degenerate. Thus, vl,r can be 
written as the following linear combination of Majorana mass eigenstates ui^^, 

'^L \ ^ f cos au -sinai. \ / ui \ 
UR ) \ smay cos ay ) \ fh ) ' 

Neutrino mass eigenstates will be called f; and Vh, the former being SM-like. With a reasonable choice of 
Yukawa couplings, the heavy neutrinos can have masses m,y^ ~ 0(100) GeV ^ M^/. 

The last subtle point is the way the Lagrangian has to be written, in particular the Majorana-like Yukawa 



terms for the right-handed neutrinos (the last term in Eq. (49 0- In order to explicitly preserve gauge 



invaiiance, this term has to be written, in two-component notation, as. 



-y ^ — 1^ — + h.c. , (55) 



where v is the Dirac field of Eq. (50 1, whose Majorana components vl,r mix as in Eq. (54 1. 

The implementation of the minimal B — L model in FeynRules was validated by comparing against 
an independent implementation in CalcHep performed by means of the LanHep package [308|. We 
compared the matrix elements squared in MadGraph and CalcHep at various randomly -generated 
phase space points for 284 two-to-two processes and we found perfect agreement in all cases. Let us 
note that the cun^ent FeynRules implementation was done in unitary gauge, whereas the LanHep 
implementation allows for both Feynman and unitary gauges, and hence we demonstrated explicitly the 
gauge invariance of our implementation. Furthermore, because of mixing in the scalar sector, we paid 
particular attention to the unitarity cancellation in weak boson scattering to ensure that the implementation 
has the correct high-energy behavior. A selection of the results obtained during the validation procedure is 
shown in Table [8] A more extensive list, together with the corresponding FeynRules model file, can be 
obtained from the FeynRules model database. 



20.42 THE NEXT-TO-MINIMAL SUPERSYMMETRIC STANDARD MODEL 

The Next-to-Minimal Supersymmetric Standard Model (NMSSM) is a viable extension of the MSSM in 
the Higgs sector of the model (see, e.g., the references in [309J). The main motivation for the NMSSM 
is a possible solution to the so-called jj, problem, the fact that the supersymmetric parameter fi in the 
superpotential is dimensionful and should in principle be of the order of the SUSY breaking scale. A 
working electroweak symmetry breaking demands this parameter, however, in the ball park of a few 
hundred GeV. The NMSSM addresses this problem by enlarging the particle spectrum by a single chiral 
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Process 


MG (FR) 


CH (FR) CH (LH) 


e+e" i^iiuii 
e+e" mi^hi 
e+e" J^/iiz^hi 
e+e- ^ ZZ' 

w^+ir- ^ w+w- 


5.681e+01 
5.315e-12 
2.697e-02 
1.570e-01 
1.420e+03 


5.687e+01 5.687e+01 
5.307e-12 5.307e-12 
2.691e-02 2.691e-02 
1.569e-01 1.569e-01 
1.421e+03 1.421e+03 



Table 8: Cross section (in pb) for a selection of processes in the minimal B — L model corresponding to 
^ = 4 TeV. A pT cut of 20 GeV was applied to all final state particles. MG (FR) and CH (FR) refer to 
the FeynRules implementations in MadGraph and CalcHep respectively, whereas CH (LH) refers 
to the LanHep implementation of Ref. |308| . 



superfield S, being a singlet under the SM gauge group. When the singlet acquires a vev, it generates an 
effective /x parameter. The quartic term for the Higgs-potential is generated via an F term from a cubic 
superpotential term. 

In the following, we will use the conventions of the SUSY Les Houches Accord 2 |43j44j|310j. This is 



a quite general approach, but neglecting possible CP, -R-parity, or flavor violation. The superpotential and 
the soft-breaking terms of the NMSSM are given by 

Wnmssm = Wmssm — <^ab^SHlH\ + -^i^S^ + fJ-'S'^ + CfS , 
Vsoit = Vmssm + mi +( - eabXAxSH^H'^ 

+ ^kA^S^ + m'|S2 + + h.c.) . (56) 

In the Whizard implementation, we have ommited all non-Zs symmetric terms in the superpotential, as 
well as the corresponding soft terms, while the FeynRules model includes the most general superpoten- 
tial. The field content of the NMSSM is almost the same as for the MSSM, except for an additional scalar 
and pseudoscalar Higgs boson, denoted by and A2, as well as a fifth neutralino, ^5, coming from the 



additional singlino component. In addition to the conventions from Ref. 1 43 44 310] to have left-right 
sfermion mixing in all three generations, the FeynRules implementation is a bit more general, allowing 
for inter-generational sfermion mixings. 



The tests that we have performed follow closely the strategy described in Section 20.3 We have 
compared the cross section for 736 processes in Whizard and in the FEYNRuLES-generated model files 
for CalcHep and MadGraph. A very short selection of results for processes including Higgses and 
neutralinos can be found in the Table |9] The full list of processes which we have investigated, together 
with the results for the cross sections and the selected benchmark point, will be made available in the 
FeynRules model database and on the Whizard home page. For the Whizard implementation it was 
checked that it yields the correct MSSM hmit |311[ . 



20.43 THE MINIMAL R-SYMMETRIC SUPERSYMMETRIC STANDARD MODEL 

The minimal /^-symmetric supersymmetric standard model (MRSSM) extends the usual MSSM by 
introducing a global continuous U{1)r symmetry [312]. All the quark and lepton supermultiplets carry an 
i?-charge +1, whereas the Higgs and gauge supermultiplets carry an i?-charge 0. This implies that gaugino 
Majorana mass terms are forbidden, as well as the //-term in the superpotential. Since supersymmetry 
must be broken, the gauginos must be massive and we need a new mechanism to introduce gaugino mass 
terms. This is achieved by including for each gauge supermultiplet an additional chiral supermultiplet, 
transforming in the adjoint representation of the gauge group and with an i?-charge 0, making it possible 
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Process 


wo (ST) 


MG (FR) 


CH (FR) 


6+ e — ^ 


xlxl 


4.882e-3 


4.884e-3 


4.886e-3 




Xl xl 


3.476e-4 


3.470e-4 


3.469e-4 


w- Z ^ 


HlH- 


4.211e-3 


4.213e-3 


4.210e-3 


zz 




6.505e-3 


6.512e-3 


6.513e-3 


w+w- 


xUI 


2.055e-l 


2.056e-l 


2.056e-l 



Table 9: Cross sections for a selection of production processes in the NMSSM. The built-in MSSM 
implementation in Whizard is denoted WO (ST) while the FEYNRULES-generated ones are MG (FR) 
and CH (FR). The center-of-mass energy is fixed to 3000 GeV. 



to write down Dirac mass terms for the gauginos. As a consequence, at variance with the usual MSSM 
scenario where the neutralinos and gluinos are expected to be Majorana fermions, the gluinos and the 
weak inos in the MRSSM are four-component Dirac fermions, leading to a new phenomenology for 
supersymmetric models. In the following however, we only review the SU{3)c sector of the model. The 
electroweak sector is conceptually similar, but more involved because of the mixing in the electroweak 



gauge sector and we refer the interested reader to Ref. |313fpl6) | for further details on the MRSSM and 
its phenomenology. 

The SU (3)c sector of the MSSM describes the gauge interactions between the quark superfields 
Q = {qL,qL,FQ), U = {u'j^,ul,Fu) undD = (Jjj, d^, Fd) and the gluon superfield G = {\,g^,D). In 
the MRSSM this sector is augmented by an additional chiral supermultiplet <^g = {(f), Fg) transforming 
as an octet. In order to give the gluino a mass in an /^-symmetric way, the soft supersymmetry breaking 
part of the Lagrangian contains a Dirac mass term coupling the Weyl components of the superfields G and 

C,oit 3 Md (A'^ • + r • A") + Md D^ct^a + (l^i) ■ (57) 

The physical gluino field then corresponds to the four-component Dirac fermion g = (x, A)^. The 
interactions between the superfields are described by the usual supersymmetric gauge interactions. 
Integrating out auxiliary fields and omitting all terms involving quark fields left unchanged with respect to 
the usual MSSM, the gauge interactions can be described by the Lagrangian, 



+ -Ml {4>a + <Ai)' + ^Ml {ct>a + 4f + -M^s (0, - 4>lf + Md (A" ■x'^ + T- ^ 



+ Qs Md {(^a + cl>l) qiT^L - u^rT^ur - d'j.T'^dR (58) 



„2 . 

^ r,"! _ ~t rpa~ „ _ Jt T^a J, a fabc . t 



2 ^qlT''qL-u'^nR-d'j^T''dR-if ^^^^^ 
+ iV2gsr'' (^x^-A'^ + ^feA^-r 
where denotes the covariant derivative in the adjoint representation of SU (3)c, 

D^ = d^-ig,F-g''^, (59) 

and {F°-)^^ = —if°-^'^ denote the generators of the adjoint representation. 

We implemented the full MRSSM in FeynRules in its most general form, i.e., keeping generic and 
possible complex mixing between particles. The validation of the implementation is currently on-going. 
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20.5 CONCLUSION 

In this summary report we presented the activities of the FeynRules working group triggered by 
discussion at the Les Houches workshop. We argued that the FeynRules framework does not only 
provide a natural framework where BSM models can easily be developed and implemented, but they can 
also be validated to an unprecedented level by exploiting the fact that the model can be exported to various 
matrix element generators in an automated way. We proposed a rating scheme based on a four credit 
points system by which FeynRules models can be classified with respect to their level of validation, 
in the perspective of building a database of robust and thoroughly validated implementations, and we 
illustrated the power of this proposal on several newly implemented and validated models, in particular 
the minimal B — L model, well as several supersymmetric extensions of the Standard Model. 
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